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(54) Title: CLASSIFICATION OF BREAST CANCER PATIENTS USING A COMBINATION OF CLINICAL CRITERIA AND 
INFORMATIVE GENESETS 

(57) Abstract: The present invention provides prognostic methods for conditions such as cancer, for example, breast cancer, com- 
prising classifying an individual by a plurality of phenotypic, genotypic or clinical characteristics of the condition into a plurality of 
patient subsets, and analyzing the pattern of expression of prognosis-informative genes identified for that subset in a sample from 
the individual. The present invention also provides methods for constructing such patient subsets and of identifying prognosis-in- 
formative genesets for such subsets. The invention further provides methods of assigning a therapeutic regimen to an individual, 
microarrays useful for performing prognosis, kits comprising these microarrays, and computer systems and programs for implement- 
ing the methods of the invention. 
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CLASSIFICATION OF BREAST CANCER PATIENTS USING A 
COMBINATION OF CLINICAL CRITERIA AND INFORMATIVE GENESETS 



[0001] This application claims the benefit under 35 U.S.C. § 1 19(e) of U.S. Provisional 
Patent Application No. 60/650,401, filed on February 4 5 2 0 05, U.S. Provisional Patent 
Application No. 60/604,076, filed on August 24, 2004, and U.S. Provisional Patent 
Application No. 60/550,810, filed on March 5, 2004, each of which is incorporated by 
reference herein in its entirety. 

1 . FIELD OF THE INVENTION 

[0002] The present invention relates to the use of both phenotypic and genotypic aspects of 
a condition, such as a disease, in order to identify discrete subsets of patients for which 
specific sets of informative genes are then identified. The invention also relates to the 
classification of individuals, such as breast cancer patients, into a subset of the condition on 
the basis of clinical parameters and the status of markers, for example, of genes expression 
patterns, and the prognosis of those individuals on the basis of markers informative for 
prognosis within the subset of the condition. The invention also relates to methods of 
determining a course of treatment or therapy to an individual having, or suspected of having, 
a condition, such as breast cancer. The invention further relates to methods of structuring a 
clinical trial, particularly using five breast cancer-specific patient subsets and prognosis- 
informative genes for each, and of identifying patient populations for clinical trials or for 
other condition-related, for example, breast cancer-related, research. Finally, the invention 
relates to computer implementations of the above methods. 

2. BACKGROUND OF THE INVENTION 

[0003] The increased number of cancer cases reported in the United States, and, indeed, 
around the world, is a major concern. Currently there are only a handful of treatments 
available for specific types of cancer, and these provide no guarantee of success. In order to 
be most effective, these treatments require not only an early detection of the malignancy, but 
a reliable assessment of the severity of the malignancy. 

[0004] The incidence of breast cancer, a leading cause of death in women, has been 
gradually increasing in the United States over the last thirty years. Its cumulative risk is 
relatively high; 1 in 8 women are expected to develop some type of breast cancer by age 85 in 
the United States. In fact, breast cancer is the most common cancer in women and the second 
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most common cause of cancer death in the United States. In 1997, it was estimated that 
181,000 new cases were reported in the U.S., and that 44,000 people would die of breast 
cancer (Parker et al, CA Cancer J. Clin. 47:5-27(1997); Chu et ah 9 J. Nat Cancer Inst 
88: 1 571-1 579 (1996)). While mechanism of tumorigenesis for most breast carcinomas is 
largely unknown, there are genetic factors that can predispose some women to developing 
breast cancer (Miki et al, Science, 266:66-71(1994)). 

[0005] Sporadic tumors, those not currently associated with a known germline mutation, 
constitute the majority of breast cancers. It is also likely that other, non-genetic factors also 
have a significant effect on the etiology of the disease. Regardless of the cancer's origin, 
breast cancer morbidity and mortality increases significantly if it is not detected early in its 
progression. Thus, considerable effort has focused on the early detection of cellular 
transformation and tumor formation in breast tissue. 

[0006] A marker-based approach to tumor identification and characterization promises 
improved diagnostic and prognostic reliability. Typically, the diagnosis of breast cancer 
requires histopathological proof of the presence of the tumor. In addition to diagnosis, 
histopathological examinations also provide information about prognosis and selection of 
treatment regimens. Prognosis may also be established based upon clinical parameters such 
as tumor size, tumor grade, the age of the patient, and lymph node metastasis. 
[0007] Diagnosis and/or prognosis may be determined to varying degrees of effectiveness 
by direct examination of the outside of the breast, or through mammography or other X-ray 
imaging methods (Jatoi, Am. J. Surg. 177:518-524 (1999)). The latter approach is not 
without considerable cost, however. Every time a mammogram is taken, the patient incurs a 
small risk of having a breast tumor induced by the ionizing properties of the radiation used 
during the test. In addition, the process is expensive and the subjective interpretations of a 
technician can lead to imprecision. For example, one study showed major clinical 
disagreements for about one-third of a set of mammograms that were interpreted individually 
by a surveyed group of radiologists. Moreover, many women find that undergoing a 
mammogram is a painful experience. Accordingly, the National Cancer Institute has not 
recommended mammograms for women under fifty years of age, since this group is not as 
likely to develop breast cancers as are older women. It is compelling to note, however, that 
while only about 22% of breast cancers occur in women under fifty, data suggests that breast 
cancer is more aggressive in pre-menopausal women. 

[0008] In clinical practice, accurate diagnosis of various subtypes of breast cancer is 
important because treatment options, prognosis, and the likelihood of therapeutic response all 
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vary broadly depending on the diagnosis. Accurate prognosis, or determination of distant 
metastasis-free survival could allow the oncologist to tailor the administration of adjuvant 
chemotherapy, with women having poorer prognoses being given the most aggressive 
treatment. Furthermore, accurate prediction of poor prognosis would greatly impact clinical 
trials for new breast cancer therapies, because potential study patients could then be stratified 
according to prognosis. Trials could then be limited to patients having poor prognosis, in 
turn making it easier to discern if an experimental therapy is efficacious. 
[0009] To date, no set of satisfactory predictors for prognosis based on the clinical 
information alone has been identified. Many have observed that the ER status has a dominant 
signature in the breast tumor gene expression profiling. See West et al, Proc. Natl Acad, 
Scl U.S.A. 98:1 1462 (2001); van 't Veeretal, Nature 415:530 (2002); Sorlie et al, Proc. 
Natl Acad. Scl U.S.A. 100:8418 (2003); Perou et al, Nature 406:747 (2000); Gruvberger et 
al, Cancer Res. 61:5979 (2001); Sotiriou et al, Proc. Natl Acad. Scl U.S.A. 100:10393 
(2003). It is generally accepted that there is some relationship between patient survival and 
ER status, van de Vijver et al,N Engl J. Med. 347:1999 (2002); Surowiak et al, Folia 
Histochem. Cytobiol 39:143 (2001); Pichon et al, Br. J. Cancer 73:1545 (1996); Collett et 
al, J- Clin. Pathol. 49:920 (1996). BRCA1 mutations are related to the familial cancer 
susceptibility. Biesecker et al, JAMA 269:1970 (1993); Easton et al, Cancer Surv. 18:95 
(1993). Age is also considered to be a prognosis factor since young cancer patients tend to 
have poor tumors. Maggard et al , J. Surg. Res. 113:109 (2003). Lymph node status is a 
factor in deciding the treatment. Eifel et ah, J. Natl Cancer Inst 93:979 (2001). 
[0010] The discovery and characterization of BRCAl and BRCA2 has recently expanded 
our knowledge of genetic factors which can contribute to familial breast cancer. Germ-line 
mutations within these two loci are associated with a 50 to 85% lifetime risk of breast and/or 
ovarian cancer (Casey, Curr. Opin. Oncol 9:88-93 (1997); Marcus et al, Cancer 77:697- 
709 (1996)). Only about 5% to 10% of breast cancers, however, are associated with breast 
cancer susceptibility genes, BRCAl and BRCA2. The cumulative lifetime risk of breast 
cancer for women who carry the mutant BRCAl is predicted to be approximately 92%, while 
the cumulative lifetime risk for the non-carrier majority is estimated to be approximately 
10%. BRCAl is a tumor suppressor gene that is involved in DNA repair and cell cycle 
control, which are both important for the maintenance of genomic stability. More than 90% 
of all mutations reported so far result in a premature truncation of the protein product with 
abnormal or abolished function. The histology of breast cancer in BRCAl mutation carriers 
differs from that in sporadic cases, but mutation analysis is the only way to find the carrier* 
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Like BRCA1, BRCA2 is involved in the development of breast cancer, and like BRCA1 plays 
a role in DNA repair. However, unlike BRCA1, it is not involved in ovarian cancer. 
[001 1] Other genes have been linked to breast cancer, for example c-erb-2 (HER2) and p53 
(Beenken et al, Ann. Surg. 233(5):630-638 (2001). Overexpression of c-erb-2 (HER2) and 
p53 have been correlated with poor prognosis (Rudolph et al~, Hum. Pathol 32(3):31 1-3 19 
(2001), as has been aberrant expression products of mdm2 (Lxikas et al, Cancer Res. 
61(7):3212-3219 (2001) and cyclinl and p27 (Porter & Roberts, International Publication 
WO98/33450, published August 6, 1998). 

[0012] The detection of BRCA1 or BRCA2 mutations represents a step towards the design of 
therapies to better control and prevent the appearance of these tumors. Recently, many 
studies have used gene expression profiling to analyze various cancers, and those studies 
have provided new diagnosis and prognosis information in the molecular level. See 
Zajchowski et ah, 'Identification of Gene Expression Profiled that Predict the Aggressive 
Behavior of Breast Cancer Cells," Cancer Res. 61:5168 (2001); West et al, "Predicting the 
Clinical Status of Human Breast Cancer by Using Gene Expression Profiles," Proc. Natl 
Acad. Set U.S.A. 98:1 1462 (2001); van 't Veer et al, "Gene Expression Profiling Predicts the 
Outcome of Breast Cancer," Nature 415:530 (2002); Roberts *t al., "Diagnosis and Prognosis 
of Breast Cancer Patients," WO 02/103320; Sorlie et al.,Proc?. Natl Acad. Set U.S.A. 
100:8418 (2003); Perou etal, Nature 406:747 (2000); Khan &t al, Cancer Res 58, 5009 
(1998); Golub et al, Science 286, 531 (1999); DeRisi et al, hTat. Genet. 14:457 (1996); 
Alizadeh et al, Nature 403, 503 (2000). Methods for the identification of informative 
genesets for various cancers have also been described. See Roberts et al, "Diagnosis and 
Prognosis of Breast Cancer Patients," WO 02/103320; Golub &t al, United States Patent No, 
6,647,341. 

[0013] Genesets have been identified that are informative for- differentiating individuals 
having, or suspected of having, breast cancer based on estrogexi receptor (ER) status, or 
BRCA1 mutation vs. sporadic (i.e., other than BRCAl-type) mutation status. See Roberts et 
al, WO 02/103320; van't Veer et al, Nature 415:530 (2001). Genesets have also been 
identified that enable the classification of sporadic tumor-type individuals as those who will 
likely have no metastases within five years of initial diagnosis (i.e., individuals with a good 
prognosis) or those who will likely have a metastasis within five years of initial diagnosis 
(i.e., those having a poor prognosis). Roberts, supra; van't Veer, supra. 
[0014] Roberts et al WO 02/103320 describes a 70-gene set, useful for the prognosis of 
breast cancer, which outperformed clinical measures of prognosis, and which showed good 
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potential in selecting good outcome patients, thereby avoiding over-treatment, van de Vijver 
et al 9 N. Engl J. Med 347:1999 (2002). The expression of genes with most predictive value, 
however, were not homogeneous among poor patients, suggesting the need for improvement. 
[0015] Although the patterns of gene expression as described in Roberts et ah were 
correlated with existing clinical indicators such as estrogen receptor and BRCA1 status, 
clinical measures were not incorporated. Furthermore, although the poor-outcome group in 
particular showed heterogeneity in expression pattern, the? best classifier decision rule found 
during these studies was a fairly simple one based on the similarity of a patient profile to the 
average profile of a good-outcome training group. 

[0016] It is evident that breast cancer is the result of more than one type of molecular event. 
Likewise, a variety of other conditions, such as other cancers; non-cancer diseases such as 
diabetes, autoimmune or neurodegenerative disorders, obesity; etc., are also the result of 
more than one molecular event. Moreover, an individual 9 s response to exposure to particular 
environmental conditions, for example, exposure to natural or man-made agents, such as 
toxins, pollutants, drugs, food additives, etc., likely result from more than one molecular 
event. Thus, there exists a need for improved prognostic methods so that appropriate courses 
of prophylaxis and/or therapy may be provided. Genesets having improved prognostic power 
can be identified by first identifying discrete subsets of individuals based on genotypic or 
phenotypic characteristics relevant to the disease or condition, and then identifying genesets 
informative for prognosis within those subsets of patients. Individuals having the condition, 
or who are suspected of having the condition, such as breast cancer, would then be provided 
therapies appropriate to the molecular mechanisms underlying the condition. The present 
invention provides such methods for breast cancer, and for- other cancers, diseases or 
conditions. 

3. SUMMARY OF THE INVENTION 

[0017] The present invention provides methods of identifying relevant subsets of 
conditions, and the identification of markers relevant to those subsets, for example, for 
prognosis of individuals classifiable into one of those subsets. The invention further provides 
sets of markers useful for the prognosis of individuals havLng breast cancer, wherein those 
patients have been classified according to one or more chairacteristics of breast cancer. 
[001 8] Thus, the present invention provides a method of identifying a set of informative 
genes or markers for a condition comprising a plurality of phenotypic or genotypic 
characteristics, comprising: (a) classifying each of a plurality of samples or individuals on the 
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basis of one or more phenotypic or genotypic characteristics of said condition into a plurality 
of first classes; and (b) identifying within each of said fiirst classes a first set of genes or 
markers informative for said condition, wherein said firsi: set of genes or markers within each 
of said first classes is unique to said class relative to other first classes. In a specific 
embodiment, this method further comprises additionally classifying into a plurality of second 
classes said samples or individuals in at least one of said first classes on the basis of a 
phenotypic or genotypic characteristic different that that used in said classifying step (a); and 
identifying within at least one of said second classes a second set of informative genes or 
markers, wherein said second set of informative genes or markers within each of said second 
classes is unique to said second class relative to other first and second classes. 
[0019] The invention further provides a method of identifying a set of informative genes or 
markers for a condition comprising a plurality of phenotypic or genotypic characteristics, 
comprising: (a) classifying each of a plurality of samples or individuals on the basis of one or 
more phenotypic or genotypic characteristics into a plurality of first classes; (b) classifying at 
least one of said first classes into a plurality of second classes on the basis of phenotypic or 
genotypic characteristic different than that used in said classifying step (a); and (c) 
identifying within at least one of said first classes or said second classes a set of genes or 
markers informative for said condition, wherein said second set of genes or markers is unique 
to said class relative to other first and second classes. 

[0020] The invention further provides a method of identifying a set of informative genes or 
markers for a condition comprising a plurality of phenotypic or genotypic characteristics, 
comprising: (a) selecting a first characteristic from said pLiirality of phenotypic or genotypic 
characteristics; (b) identifying at least two first condition classes differentiable by said first 
characteristic; (c) selecting a plurality of individuals classifiable into at least one of said first 
condition classes; and (d) identifying in samples derived firom each of said plurality of 
individuals a set of genes or markers informative for said condition within said at least one of 
said first condition classes. 

[0021] The invention further provides a method of classifying an individual with a 
condition as having a good prognosis or a poor prognosis, comprising: (a) classifying said 
individual into one of a plurality of patient classes, said palient classes being differentiated by 
one or more phenotypic, genotypic or clinical characteristics of said condition; (b) 
determining the level of expression of a plurality of genes or their encoded proteins in a cell 
sample taken from the individual relative to a control, said plurality of genes or their encoded 
proteins comprising genes or their encoded proteins informative for prognosis of the patient 
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class into which said individual is classified; and (c) classifying said individual as having a 
good prognosis or a poor prognosis on the basis of said level of expression. In a specific 
embodiment, said condition is cancer, said good prognosis is the non-occurrence of 
metastases within five years of initial diagnosis, and said poor prognosis is the occurrence of 
metastases within five years of initial diagnosis. In a more specific embodiment, said cancer 
is breast cancer. In another specific embodiment, said control is the average level of 
expression of each of said plurality of genes or their encoded proteins across a plurality of 
samples derived from individuals identified as having a poor prognosis. In a more specific 
embodiment, said classifying step (c) is earned out by a method comprising comparing the 
level of expression of each of said plurality of genes or ttaeir encoded proteins to said average 
level of expression of each corresponding gene or its encoded protein in said control, and 
classifying said individual as having a poor prognosis if said level of expression correlates 
with said average level of expression of each of said genes or their encoded proteins in said 
control more strongly than would be expected by chance. In another specific embodiment, 
said control is the average level of expression of each of said plurality of genes or their 
encoded proteins across a plurality of samples derived from individuals identified as having a 
good prognosis. In a more specific embodiment, said classifying in step (c) is carried out by 
a method comprising comparing the level expression of each of said plurality of genes or 
their encoded proteins to said average level of expression of each corresponding gene or its 
encoded protein in said control, and classifying said individual as having a good prognosis if 
said level of expression correlates with said average level of expression of each of said genes 
or their encoded proteins in said control more strongly than would be expected by chance. In 
another specific embodiment, said plurality of patient classes comprises ER~~, BRCAI 
individuals; ER~, sporadic individuals; ER+, ER/AGE high individuals; ER+, ER/AGE low, 
LN+ individuals; and ER+, ER/AGE low, LN~ individuals. 

[0022] The invention further provides a method of classifying a breast cancer patient as 
having a good prognosis or a poor prognosis comprising: <a) classifying said breast cancer 
patient as ER~~, BRCAI; ER~ sporadic; ER+, ER/AGE high; ER+, ER/AGE low, LN+; or 
ER+, ER/AGE low, LN~; (b) determining the level of expression of a first plurality of genes 
in a cell sample taken from said breast cancer patient relative to a control, said first plurality 
of genes comprising two of the genes corresponding to the markers in Table 1 if said breast 
cancer patient is classified as ER~ BRCAI; in Table 2 if said breast cancer patient is 
classified as ER~ sporadic; in Table 3 if said breast cancer patient is classified as ER+, 
ER/AGE high; in Table 4 if said breast cancer patient is classified as ER+. ER/AGE low, 
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LN+; or in Table 5 if said breast cancer patient is classified els ER+, ER/AGE low, LN~~; and 
(c) classifying said breast cancer patient as having a good prognosis or a poor prognosis on 
the basis of the level of expression of said first plurality of genes, wherein said breast cancer 
patient is "ER/AGE high" if the ratio of the logio(ratio) of ER gene expression to age exceeds 
a predetermined value, and "ER/AGE low" if the ratio of the logi 0 (ratio) of ER gene 
expression to age does not exceed said predetermined value. In a specific embodiment, said 
control is the average level of expression of each of said plurality of genes in a plurality of 
samples derived from ER", BRCAl individuals, if said breast cancer patient is ER~, BRCA1; 
the average level of expression of each of said plurality of genes in a plurality of samples 
derived from ER~, sporadic individuals if said breast cancer patient is ER~, sporadic; the 
average level of expression of each of said plurality of genes in a plurality of samples derived 
from ER+, ER/AGE high individuals, if said breast cancer patient is ER+, ER/AGE high; the 
average level of expression of each of said plurality of genes in a plurality of samples derived 
from ER+, ER/AGE low, LN+ individuals where said breast cancer patient is ER+, ER/AGE 
low, LN+; or the average level of expression of each of said plurality of genes in a plurality 
of samples derived from ER+, ER/AGE low, LIST individuals where said breast cancer patient 
is ER+, ER/AGE low, LIST. In a more specific embodiment, each of said individuals has a 
poor prognosis. In another more specific embodiment, each of said individuals has a good 
prognosis. In an even more specific embodiment, said classifying step (c) is carried out by a 
method comprising comparing the level of expression of each, of said plurality of genes or 
their encoded proteins in a sample from said breast cancer patient to said control, and 
classifying said breast cancer patient as having a poor prognosis if said level of expression 
correlates with said average level of expression of the corresponding genes or their encoded 
proteins in said control more strongly than would be expected by chance. In another specific 
embodiment, said predetermined value of ER is calculated as ER = 0.1 (AGE ~ 42.5), wherein 
AGE is the age of said individual. In another specific embodiment, said individual is ER~ 
BRCAl, and said plurality of genes comprises two of the genes for which markers are listed 
in Table 1. In another specific embodiment, said individual is ER", BRCAl, and said 
plurality of genes comprises all of the genes for which markers are listed in Table 1 . In 
another specific embodiment, said individual is ER~, sporadic,, and said plurality of genes 
comprises two of the genes for which markers are listed in Table 2. said individual is ER , 
sporadic, and said plurality of genes comprises all of the genes for which markers are listed in 
Table 2. In another specific embodiment, said individual is ER+, ER/AGE high, and said 
plurality of genes comprises two of the genes for which markers are listed in Table 3. said 
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individual is ER+, ER/AGE high, and said plurality of geoies comprises all of the genes for 
which markers are listed in Table 3. In another specific embodiment, said individual is ER+, 
ER/AGE low, LN+, and said plurality of genes comprises two of the genes for which markers 
are listed in Table 4. In another specific embodiment, said individual is ER+, ER/AGE low, 
LN+, and said plurality of genes comprises all of the genes for which markers are listed in 
Table 4. In another specific embodiment, said individual is ER+, ER/AGE low, LIST, and 
said plurality of genes comprises two of the genes for which markers are listed in Table 4. In 
another specific embodiment, said individual is ER+, ER/AGE low, LN", and said plurality of 
genes comprises all of the genes for which markers are listed in Table 4. In another specific 
embodiment, the method further comprises determining in said cell sample the level of 
expression, relative to a control, of a second plurality of genes for which markers are not 
found in Tables 1-5, wherein said second plurality of genes is informative for prognosis. 
[0023] In another embodiment, the invention provides a method for assigning an individual 
to one of a plurality of categories in a clinical trial, comprising: (a) classifying said individual 
as ER~ BRCAU ER~, sporadic; ER+, ER/AGE high; ER-l-^ ER/AGE low, LN+; or ER+, 
ER/AGE low, LIST; (b) determining for said individual the? level of expression of at least two 
genes for which markers are listed in Table 1 if said individual is classified as ER~, BRCAJ; 
Table 2 if said individual is classified as ER~, sporadic; Table 3 if said individual is classified 
as ER+, ER/AGE high; Table 4 if said individual is classified as ER+, ER/AGE low, LN+; or 
Table 5 if said individual is classified as ER+, ER/AGE low, LIST; (c) determining whether 
said individual has a pattern of expression of said at least two genes that correlates with a 
good prognosis or a poor prognosis; and (d) assigning said individual to one category in a 
clinical trial if said individual has a good prognosis, and assigning said individual to a second 
category in said clinical trial if said individual has a poor prognosis. In a specific 
embodiment, said individual is additionally assigned to a category in said clinical trial on the 
basis of the classification of said individual as determined in step (a). In another specific 
embodiment, said individual is additionally assigned to a category in said clinical trial on the 
basis of any other clinical, phenotypic or genotypic characteristic of breast cancer. In another 
specific embodiment, said method further comprises determining in said cell sample the level 
of expression, relative to a control, of a second plurality of genes for which markers are not 
found in Tables 1-5, wherein said second plurality of genes is informative for prognosis of 
breast cancer, and determining from the expression of said second plurality of genes, in 
addition to said first plurality of genes, whether said individual has a good prognosis or a 
poor prognosis. 
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[0024] The invention further provides a microarray comprising probes complementary and 
hybridizable to a plurality of the genes for which markers are listed in amy of Tables 1-5. The 
invention further provides a microarray comprising probes complementary and hybridizable 
to a plurality of the genes for which markers are listed in Table 1, each of the genes for which 
markers are listed in Table 1, a plurality of the genes for which markers are listed in Table 2, 
each of the genes for which markers are listed in Table 2, a plurality of the genes for which 
markers are listed in Table 3, each of the genes for which markers are listed in Table 3, a 
plurality of the genes for which markers are listed in Table 4, each of the genes for which 
markers are listed in Table 4 ? a plurality of the genes for which markers are listed in Table 5, 
or each of the genes for which markers are listed in Table 5. The invention further provides 
any one of the above microarrays, wherein said probes are at least 50% of the probes on said 
microarray. The invention further provides any one of the above microarray s, wherein said 
probes are at least 90% of the probes on said microarray. The invention jfurther provides 
microarray comprising probes complementary and hybridizable to a plurality of the genes for 
which markers are listed in any of Tables 1-5, wherein said probes are complementary and 
hybridizable to at least 75% of the genes for which markers are listed in Table 1 ; are 
complementary and hybridizable to at least 75% of the genes for which markers are listed in 
Table 2; are complementary and hybridizable to at least 75% of the genes for which markers 
are listed in Table 3; are complementary and hybridizable to at least 75% of the genes for 
which markers are listed in Table 4; and are complementary and hybridizable to at least 75% 
of the genes for which markers are listed in Table 5, wherein said probes* in total, are at least 
50% of the probes on said microarray. 

[0025] The invention further comprises a kit comprising any one of the above microarrays 
in a sealed container. 

[0026] The invention further provides a method of identifying a set of genes informative for 
a condition, said condition having a plurality of phenotypic or genotypic characteristics such 
that samples may be categorized by at least one of said phenotypic or genotypic 
characteristics into at least one characteristic class, said method comprising: (a) selecting a 
plurality of samples from individuals having said condition; (b) identifying a first set of genes 
informative for said- characteristic class using said plurality of samples; (c) predicting the 
characteristic class of each of said plurality of samples; (d) discarding samples for which said 
characteristic class is incorrectly predicted; (e) repeating steps (c) and (d) at least once; and 
(f) identifying a second set of genes informative for said characteristic class using samples in 
said plurality of samples remaining after step (e). 
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[0027] The invention further provides a method for assigning an individual to one of a 
plurality of categories in a clinical trial, comprising: (a) classifying the individual into one of 
a plurality of condition categories differentiated by at least one genotypic or phen_otypic 
characteristic of the condition; (b) determining the level of expression, in a sample derived 
from said individual, of a plurality of genes informative for said condition categoxy; (c) 
determining whether said level of expression of said plurality of genes indicates tliat the 
individual has a good prognosis or a poor prognosis; and (d) assigning the individual to a 
category in a clinical trial on the basis of prognosis. 

[0028] The invention also provides a method for identifying one or more sets of~ informative 
genes or markers for a condition in an organism, comprising: (a) subdividing a plxirality of 
individuals or samples derived therefrom of the organism subject to the condition into a 
plurality of classes based on one or more clinical, phenotypic or genotypic characteristics of 
the organism, wherein each class consists of a plurality of individuals or samples derived 
therefrom of the organism each of which having one or more clinical, phenotypic or 
genotypic characteristics specific for the class; and (b) attempting to identify for each of one 
or more of said plurality of classes a set of genes or markers informative for said condition in 
individuals in said class, wherein, if a set of genes or markers informative for said condition 
in individuals in said class is obtained for any of said one or more of said plurality of classes, 
said set of genes or markers is taken as a set of informative genes or markers for said 
condition in said organism. 

[0029] In one embodiment, the method further comprises, for each of one or more of said 
classes in which a set of genes or markers informative for said condition in individuals in said 
class cannot be obtained, repeating said steps (a) and (b) on said plurality of individuals or 
samples derived therefrom in said class such that said plurality of individuals or samples 
derived therefrom in said class is subdivided into a plurality of additional classes "based on 
one or more clinical, phenotypic or genotypic characteristics of said organism whdch are 
different from those used for defining said class, wherein for each of said plurality of 
additional classes, if a set of genes or markers informative for said condition in individuals in 
said class is obtained, said set of genes or markers is taken as a set of informative genes or 
markers for said condition in said organism. 

[0030] The invention also provides a method for identifying one or more sets or informative 
genes or markers for a condition in an organism, comprising: (a) subdividing a plurality of 
individuals or samples derived therefrom of said organism subject to said condition into a 
plurality of classes based on one or more clinical, phenotypic or genotypic characteristics of 
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said organism, wherein each said class consists of a plurality of individuals or samples 
derived therefrom of said organism each having said one or more clinical, phenotypic or 
genotypic characteristics specific for said class; (b) attempting to identify for each of one or 
more of said plurality of classes a set of genes or markers informative for said condition in 
individuals in said class, wherein if a set of genes or markers informative for said condition in 
individuals in said class is identified for any of said one or more of said classes, said set of 
genes or markers is taken as a set of informative genes or markers for a condition in said 
organism; and (c) for each of one or more of said classes in which a set of genes or markers 
informative for said condition in individuals in said class cannot be obtained, repeating said 
steps (a) and (b) on said plurality of individuals or samples derived therefrom in said class 
such that said plurality of samples or individuals in said class is subdivided into a plurality of 
additional classes based on one or more clinical, phenotypic or genotypic characteristics of 
said organism which are different from those used those used for defining said class, wherein 
for each of one or more of said plurality of additional classes, if a set of genes or markers 
informative for said condition in individuals in said class is obtained, said set of genes or 
markers is taken as a set of informative genes or markers for a condition in said organism. 
[0031] In the methods of the invention, the condition can be a type of cancer. In such an 
embodiment, each of said sets of genes or markers can be informative of prognosis of 
individuals in a corresponding class. In one embodiment, the condition is breast cancer, and 
the one or more clinical, phenotypic or genotypic characteristics comprise age, ER level, 
ER/AGE, BRAC1 status, and lymph node status. 

[0032] In one embodiment, the methods of the invention further comprise generating a 
template profile comprising measurements of levels of genes or markers of the set of 
informative genes or markers for said class representative of levels of the genes or markers in 
a plurality of patients having a chosen prognosis level. 

[0033] The invention also provides a method for predicting a breast cancer patient as having 
a good prognosis or a poor prognosis, comprising: (a) classifying said breast cancer patient 
into one of the following classes: (al) ER~ BRCAl\ (a2) ER"", sporadic; (a3) ER+, ER/AGE 
high; (a4) ER+, ER/AGE low, LN+; or (a5) ER+, ER/AGE low, LN~; (b) determining a 
profile comprising measurements of a plurality of genes or markers in a cell sample taken 
from said breast cancer patient, said plurality of genes markers comprising at least two of the 
genes or markers corresponding to the markers in (bl) Table 1 if said breast cancer patient is 
classified as ER - ", BRCAI; (b2) Table 2 if said breast cancer patient is classified as ER - 
sporadic; (b3) Table 3 if said breast cancer patient is classified as ER+, ER/AGE high; (b4) 

12 



WO 2005/086891 



PCT/US2005/007894 



Table 4 if said breast cancer patient is classified as ER+, ER/AGE low, LN+; or (b5) Table 5 
if said breast cancer patient is classified as ER+ 5 ER/AGE low, LN~; and (c) classifying said 
breast cancer patient as having a good prognosis or a poor prognosis based on said profile of 
said plurality of genes or markers, wherein ER + designates a high ER level and ER~ 
designates a low ER level, wherein said ER/A.GE is a metric of said ER level relative to the 
age of said patient, and wherein LN + designates a greater than 0 lymph nodes status in said 
patient and LN" designates a 0 lymph nodes status in said patient. 

[0034] In one embodiment, step (c) is carried out by a method comprising comparing said 
profile to a good prognosis template and/or a poor prognosis template, and wherein said 
patient is classified as having a good prognosis if said profile has a high similarity to a good 
prognosis template or has a low similarity to a poor prognosis template or as having a poor 
prognosis if said profile has a low similarity to a good prognosis template or has a high 
similarity to a poor prognosis template. A good prognosis template comprises measurements 
of said plurality of genes or markers representative of levels of said genes or markers in a 
plurality of good outcome patients, while a poor prognosis template comprises measurements 
of said plurality of genes or markers representative of levels of said genes or markers in a 
plurality of poor outcome patients. Here a good outcome patient is a breast cancer patient 
who has non-reoccurrence of metastases within a first period of time after initial diagnosis, 
while a poor outcome patient is a patient who has reoccurrence of metastases within a second 
period of time after initial diagnosis. 

[0035] In another embodiment, the methods for predicting the prognosis of a breast cancer 
patient further comprise determining said profile, said ER level, said LN status, and/or, said 
ER/AGE. In one embodiment, said profile is em expression profile comprising measurements 
of a plurality of transcripts in a sample derived from said patient, wherein said good 
prognosis template comprises measurements of said plurality of transcripts representative of 
expression levels of said transcripts in said plurality of good outcome patients, and wherein 
said poor prognosis template comprises measiarements of said plurality of transcripts 
representative of expression levels of said trarxscripts in said plurality of poor outcome 
patients. 

[0036] In one embodiment, said expression profile is a differential expression profile 
comprising differential measurements of said plurality of transcripts in said sample derived 
from said patient versus measurements of said plurality of transcripts in a control sample. 
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[0037] In one embodiment, the measurement of each said transcript in said good prognosis 
template is an average of expression levels of said transcript in said plurality of good 
outcome patients. 

[0038] In one embodiment, the similarity of said expression profile to said good or poor 
prognosis template is represented by a correlation coefficient between said expression profile 
and said good or poor prognosis template, respectively, and a correlation coefficient greater 
than a correlation threshold, e.g., 0.5, indicates a high similarity and said correlation 
coefficient equal to or less than said correlation threshold indicates a low similarity. 
[0039] In another embodiment, the similarity of said expression profile to said good or po or 
prognosis template is represented by a distance between said cellular constituent profile and 
said good or poor prognosis template, respectively, and a distance less than a given value 
indicates a high similarity and said distance equal to or greater than said given value indicates 
a low similarity. 

[0040] In another embodiment, said profile comprises measurements of a plurality of 
protein species in a sample derived from said patient, wherein said good prognosis template 
comprises measurements of said plurality of protein species representative of levels of said 
protein species in said plurality of good outcome patients, and wherein said poor prognosis 
template comprises measurements of said plurality of protein species representative of levels 
of said protein species in said plurality of poor outcome patients. 

[0041] In one embodiment, said ER level is determined by measuring an expression level of 
a gene encoding said estrogen receptor, e.g., the estrogen receptor a gene, in said patient 
relative to expression level of said gene in said control sample, and said ER level is classified 
as ER + if loglO(ratio) of said expression level is greater than -0.65, and said ER level is 
classified as ER" if logl0(ratio) of said expression level is equal to or less than -0.65, 
[0042] In one embodiment, said ER/AGE is classified as high if said ER level is greater 
than c-(AGE - d) 9 and said ER/AGE is classified as low if said ER level is equal to or less 
than c { AGE - d), wherein c is a coefficient, AGE is the age of said patient, and d is an age 
threshold. 

[0043] In a specific embodiment, said estrogen receptor level is measured by a 
polynucleotide probe that detects a transcript corresponding to the gene having accession 
number NM_000125, said control sample is a pool of breast cancer cells of different patient s, 
and c = 0.1 and d = 42.5. 
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[0044] In one embodiment, said control sample is generated by pooling together cDNAs of 
said plurality of transcripts from a plurality of breast cancer patients. In another embodiment, 
said control sample is generated by pooling together synthesized cDNAs of said plurality of 
transcripts and said transcript of said gene encoding said estrogen receptor. 
[0045] In one embodiment, said individual is ERT, BRCA1, and said plurality of genes 
comprises at least two of the genes for which markers are listed in Table 1 . In one 
embodiment, said individual is ER~ BRCAU and said plurality of genes comprises all of the 
genes for which markers are listed in Table 1 . 

[0046] In another embodiment, the individual is ERT, sporadic, and said plurality of genes 
comprises at least two of the genes for which markers are listed in Table 2. In one 
embodiment, said individual is ER~, sporadic, and said plurality of genes comprises all of the 
genes for which markers are listed in Table 2. 

[0047] In still another embodiment, said individual is ER+, ER/AGE high, and said plurality 
of genes comprises at least two of the genes for which markers are listed in Table 3. In one 
embodiment, said individual is ER+, ER/AGE high, and said plurality of genes comprises all 
of the genes for which markers are listed in Table 3. 

[0048] In still another embodiment, said individual is ER+, ER/AGE low, LN+, and said 
plurality of genes comprises at least two of the genes for which markers are listed in Table 4. 
In one embodiment, said individual is ER+, ER/AGE low, LN+, and said plurality of genes 
comprises all of the genes for which markers are listed in Table 4. 

[0049] In still another embodiment, said individual is ER+, ER/AGE low, LN~, and said 
plurality of genes comprises at least two of the genes for which markers are listed in Table 4. 
In one embodiment, the individual is ER+, ER/AGE low, LIST, and said plurality of genes 
comprises all of the genes for which markers are listed in Table 4. 

[0050] In one embodiment, said profile further comprises one or more genes for which 
markers are not found in Tables 1-5, which are informative for prognosis. 
[0051] The invention also provides a method for assigning an individual to one of a 
plurality of categories in a clinical trial, comprising assigning said individual to one category 
in a clinical trial if said individual has a good prognosis as determined by any one of the 
methods described above, and assigning said individual to a second category in said clinical 
trial if said individual has a poor prognosis as determined by any one of the methods 
described above. 
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[0052] In one embodiment, said individual is additionally assigned to a category in said 
clinical trial on the basis of the classification of said individual based on said profile, said ER 
level, said LN status, and/or, said ER/AGE. 

[0053] In one embodiment, said individual is additionally assigned to a category in said 
clinical trial on the basis of one or more other clinical, phenotypic or genotypic characteristic 
of breast cancer. 

[0054] In one embodiment, the method further comprises determining in said cell sample 
the levels of expression of said one or more genes for which markers are not found in Tables 
1-5, and determining from said expression levels of said one or more genes, whether said 
individual has a good prognosis or a poor prognosis. 

4. BRIEF DESCRIPTION OF THE DRAWINGS 

[0055] FIG. 1 depicts the decision tree that resulted in the five patient subsets used to 
identify informative prognosis-related genes. 

[0056] FIG. 2: Relationship between ER level and age. (A) Scatter plot of ER vs. age for 
ER+ patients. Black dots indicate metastases free samples, and gray dots indicate metastases 
samples. It appears that patients of ER+ group can be subdivided into "ER+, ER/AGE high" 
group (above the black line) and "ER+, ER/AGE low" (below the black line) group. The 
black line is approximated by ER = 0.1*(AGE-42.5), and the dashed line by ER = 0.1*(age- 
50). Within each population, the ER level also increases with age. (B) Age distribution of all 
patients in ER+ samples. A bimodal distribution is observed. (C) ER-modulated age (age - 
10*) distribution of all patients in ER+ samples. A bimodal distribution is observed. (D) 
Age distribution of samples with metastasis. (E) ER-modulated age distribution of samples 
with metastasis. The three peaks appearing in this distribution suggest a polymorphism. 
[0057] FIG. 3. Performance of classifier for the "ER7sporadic" group. (A) Error rate 
obtained from leave-one-out cross validation (LOOCV) for predicting the disease outcome as 
a function of the number of reporter genes used in the classifier. (B) Scatter plot between 
correlation to good group (X axis) and to poor group (Y axis). Circles indicate metastases- 
free samples, squares indicate samples with metastases. Dashed line: threshold for separating 
poor from good. (C) Error rate calculated with respect to good outcome group (good 
outcome misclassified as poor divided by total number of good), or poor outcome group 
(poor outcome misclassified as good divided by total number of poor), or the average of the 
two rates. 
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[0058] FIG. 4. Performance of classifier for the "ER+, ER/AGE high" group. (A) Error 
rate obtained from leave-one-out cross validation (LOOCV) for predicting the disease 
outcome as a function of the number of reporter genes used in the classifier. (B) Scatter plot 
between correlation to good group (X axis) and to poor group (Y axis). Circles indicate 
metastases-free samples, and squares indicate samples with metastases. Dashed line: 
threshold for separating poor from good. (C) Error rate calculated with respect to good 
outcome group (good outcome misclassified as poor divided by total number of good), or 
poor outcome group (poor outcome misclassified as good divided by total number of poor), 
or the average of the two rates. 

[0059] FIG. 5. Performance of classifier for the "ER+, ER/AGE low/LN~ 9 group. (A) 
Error rate obtained from leave-one-out cross validation (LOOCV) for predicting the disease 
outcome as a function of the number of reporter genes used in the classifier. (B) Scatter plot 
between correlation to good group (X axis) and to poor group (Y axis). Circles indicate 
metastases-free samples, and squares indicates samples with metastases. Dashed line 
indicates the threshold for separating poor from good. (C) Error rate calculated with respect 
to good outcome group (good outcome misclassified as poor divided by total number of 
good), or poor outcome group (poor outcome misclassified as good divided by total number 
of poor), or the average of the two rates. 

[0060] FIG. 6. Performance of classifier for the "ER+, ER/AGE low/LN+" group. (A) 
Error rate obtained from leave-one-out cross validation (LOOCV) for predicting the disease 
outcome as a function of the number of reporter genes used in the classifier. (B) Scatter plot 
between correlation to good group (X axis) and to poor group (Y axis). Circles indicate 
metastases free samples, squares indicate samples with metastases. Dashed line: threshold 
for separating poor from good. (C) Error rate calculated with respect to good outcome group 
(good outcome misclassified as poor divided by total number of good), or poor outcome 
group (poor outcome misclassified as good divided by total number of poor), or the average 
of the two rates. 

[0061] FIG. 7. Performance of classifier for the "ER~~, BRCA1" group. (A) Error rate 
obtained from leave-one-out cross validation (LOOCV) for predicting the disease outcome as 
a function of the number of reporter genes used in the classifier. (B) Scatter plot between 
correlation to good group (X axis) and to poor group (Y axis). Circles indicate metastases 
free samples, squares indicate samples with metastases. Dashed line: threshold for separating 
poor from good. (C) Error rate calculated with respect to good outcome group (good 
outcome misclassified as poor divided by total number of good), or poor outcome group 
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(poor outcome misclassified as good divided by total number of poor), or the average of the 
two rates. 

[0062] FIG. 8. Heatmaps of genes representing key biological functions in subgroups of 
patients: A: Cell cycle genes are predictive of outcome in patients with ER/age high. B: Cell 
cycle genes are not predictive of outcome in "ER- and sporadic" patients C: Glycolysis genes 
are predictive of outcome in patients with ER/age low and LN-. D: Glycolysis genes are not 
predictive of outcome in 'ER- & BRCA1" patients. 

5 . DETAILED DESCRIPTION OF THE INVENTION 

5.1 INTRODUCTION 
[0063] The present invention provides methods for classifying individuals having a 
condition, such as a disease, into one or more subsets of individuals, where individuals in 
each subset are characterized by one or more phenotypic or genotypic characteristics of the 
condition. The individuals may be eukaryotes or prokaryotes, may be animals such as 
mammals, including but not limited to humans, primates, rodents, felines, canines, etc., birds, 
reptiles, fish, etc. "Individuals" as used herein also encompasses single-celled organisms, or 
colonies thereof, such as bacteria and yeast. The condition may be a disease, such as cancer, 
and may be a specific cancer, such as breast cancer. The condition may also be an 
environmental condition, such as exposure to a toxin, pollutant, drug, proximity to urban or 
industrial areas, etc. 

[0064] The present invention provides methods of determining the prognosis of individuals 
having a condition, such as cancer, for example, breast cancer, or who are suspected of 
having the condition, by the use of a combination of clinical, biological or biochemical 
parameters of the condition and gene expression pattern data. For prognosis, the parameters 
selected preferably relate to or affect the progression and/or outcome of the condition. The 
pattern of gene expression within a subset of individuals having the particular condition leads 
to the identification of sets of genes within a subset that is informative for that subset, for 
example, for prognosis within that subset. In general, the successful identification of sets of 
genes informative for prognosis within a particular subset justifies the selection of the 
plurality of clinical, biological or biochemical parameters of the condition on which division 
of individuals into condition subsets is based. 

[0065] In the example of breast cancer, patient groups are first classified according to at 
least one of age, lymph node (LN) status, estrogen receptor (ER) level, and BRCA1 mutation 
status into discrete patient subsets. These clinical factors have been implicated in tumor 
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etiology as well as differences in disease outcome. These characteristics are not limiting; 
other genotypic or phenotypic characteristics of breast cancer, for example, tumor grade, 
tumor size, tumor cell type, etc., may also be used, alone or in combination with those listed 
herein, in order to classify individuals. The differences in gene expression or in tumor fate 
related to these parameters likely represent differences in tumor origin aixd tumor genesis, and 
are therefore good candidates for tumor stratification. Genesets informative for prognosis 
within each subset are then identified. New breast cancer patients are then classified using 
the same criteria, and a prognosis is made based on the geneset specific for the patient subset 
into which the patient falls. In the process of constructing a prognosis classifier within each 
patient subset, particular attention is paid to the homogeneous patterns related to the tumor 
outcome. Emergence of such homogeneous prognosis patterns may indicate the most 
common mechanism to metastasis within a subset. At the same time, successful 
identification of such patterns also justifies the parameters being used for the tumor 
stratification. To differentiate this approach from an mRNA-alone approach, the current 
approach of integrating clinical data with the gene expression data is referred to herein as a 
"comprehensive prognosis". 

5.2 DEFINITIONS 
[0066] As used herein, "BRCA1 tumor" or "BRCA1 type" means a tumor having cells 
containing a mutation of the BRCA1 locus. 

[0067] The "absolute amplitude" of correlation means the absolute value of the correlation; 
e.g., both correlation coefficients -0.35 and 0.35 have an absolute amplitude of 0.35. 
[0068] "Marker" means a cellular constituent, or a modification of a cellular constituent 
(e.g., an entire gene, EST derived from that gene, a protein encoded by tlxat gene, post- 
radiational modification of the protein, etc.) the expression or level of which changes 
between certain conditions. Where a change in a characteristic of the constituent correlates 
with a certain condition, the constituent is a marker for that condition. 

[0069] "Marker-derived polynucleotides" means the RNA transcribed from a marker gene, 
any cDNA or cRNA produced therefrom, and any nucleic acid derived therefrom, such as 
synthetic nucleic acid having a sequence derived from the gene corresponding to the marker 
gene. 

[0070] A "similarity value" is a number that represents the degree of similarity between two 
things being compared. For example, a similarity value may be a number- that indicates the 
overall similarity between a patient's expression profile of specific phenotype-related 
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markers and a template specific to that phenotype (for instance, the similarity to a "good 
prognosis" template, where the phenotype is a good prognosis). The similarity value may be 
expressed as a similarity metric, such as a correlation coefficient, or may simply be expressed 
as the expression level difference, or the aggregate of the expression level differences, 
between a patient sample and a template. 

[0071] A "patient subset" is a group of individuals, all of whom liave a particular condition, 
or are subject to a particular condition, which is distinguished from other individuals having 
that condition by one or more phenotypic, genotypic or clinical characteristics of the 
condition, or of a response to the condition. For example, where tlie condition is breast 
cancer, individuals may belong to an "ER + " or an "ERT" patient smbset, or may belong to a 
particular age group patient subset. 

[0072] A gene and/or marker is "informative" for a condition, plienotype, genotype or 
clinical characteristic if the expression of the gene or marker is correlated or anticorrelated 
with the condition, phenotype, genotype or clinical characteristic to a greater degree than 
would be expected by chance. 

[0073] An individual of a given age can be classified as "ER/AGE high" if the individual's 
ER level is higher than a threshold value for the given age. The threshold can be age- 
dependent, i.e., a different threshold for each different age. In one embodiment, the age- 
dependent threshold value is calculated as c • (AGE - d), where c is a coefficient, AGE is the 
age of the patient, and d is an age threshold. The parameters c and d depend on the ER level 
and AGE used. They can be determined by fitting patients' ER le%/el-age distribution to a 
bimodal distribution of two subgroups each having a different ER level-age dependence. In a 
specific embodiment, c = 0.1 and d= 42.5 is used for ER levels represented by a log(ratio) of 
ER expression level. Thus, for example, the threshold for a 45-year old individual in this 
embodiment is 0.1 (45-42.5), or 0.25, and if the log(ratio) of ER expression level of the 
individual is equal to or greater than 0.25, the individual is classified as "ER/AGE high"; 
otherwise, the individual is classified as "ER/AGE low." 

5.3 IDENTIFICATION OF DIAGNOSTIC AND PROGNOSTIC MARKER 
SETS 

5.3.1 IDENTIFICATION OF CONDITION SUBSETS 
[0074] The present invention provides methods of identifying sets of genes and/or markers 
useful in the diagnosis and prognosis of breast cancer. More generally, the invention also 
provides methods of identifying sets of genes and/or markers usefial in the diagnosis or 
prognosis of other cancers, and even more generally, of identify ing sets of genes and/or 
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markers useful in the differentiation between subgroups of individuals having a particular 
condition, such as a disease or exposure to a particular environmental condition. 
[0075] The method may be applied to any condition for which a plurality of phenotypic or 
genotypic subsets may be identified. The condition may be a disease; for example, iihe 
condition may be cancer, an autoimmune disease, an inflammatory disease, an infectious 
disease, a neurological disease, a degenerative disease, etc. The condition may be 
environmental; for example, the condition may be a particular diet, geographic location, etc.; 
the condition may be exposure to a compound, including, for example, a drug, a to>cin, a 
carcinogen, a foodstuff, a poison, an inhaled compound, an ingested compound, etc_; the 
condition may be a particular genetic background or predisposition to a medical condition; 
etc. 

[0076] Where the condition is cancer, the condition may be any cancer, for example, 
without limitation: leukemias, including acute leukemia, acute lymphocytic leukemia, acute 
myelocytic leukemia, myeloblasts leukemia, promyelocyte leukemia, myelomonocytic 
leukemia, monocytic leukemia, and erythroleukemia; chronic leukemia, such as chronic 
myelocytic (granulocytic) leukemia or chronic lymphocytic leukemia; polycythemia vera; 
lymphomas, such as Hodgkin's disease and non-Hodgkin's disease; multiple myelojma; 
Waldenstrom's macroglobulinemia; heavy chain disease; solid tumors, such as sarcomas and 
carcinomas, fibrosarcoma, myxosarcoma, liposarcoma, chondrosarcoma, osteogenic sarcoma, 
chordoma, angiosarcoma, endotheliosarcoma, lymphangiosarcoma, 

lymphangioendotheliosarcoma, synovioma, mesothelioma, Ewing's tumor, leiomyosarcoma, 
rhabdomyosarcoma, colon carcinoma, pancreatic cancer, breast cancer, ovarian cancer, 
prostate cancer, squamous cell carcinoma, basal cell carcinoma, adenocarcinoma, sweat gland 
carcinoma, sebaceous gland carcinoma, papillary carcinoma, papillary adenocarcinomas, 
cystadenocarcinoma, medullary carcinoma, bronchogenic carcinoma, renal cell carcinoma, 
hepatoma, bile duct carcinoma, choriocarcinoma, seminoma, embryonal carcinoma., Wilms' 
tumor, cervical cancer, testicular tumor, lung carcinoma, small cell lung carcinoma, bladder 
carcinoma, epithelial carcinoma, glioma, astrocytoma, medulloblastoma, craniopharyngioma, 
ependymoma, pinealoma, hemangioblastoma, acoustic neuroma, oligodendroglioma, 
menangioma, melanoma, neuroblastoma, or retinoblastoma; etc. 

[0077] Rather than stratifying individuals, such as patients or tumor samples derived from 
patients, by gene expression patterns in the first instance, the method of identifying sets of 
genes informative for a condition begins by identifying phenotypic, genotypic or clLnical 
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subsets of individuals within the larger class of individuals having or affected by the 
condition. 

[0078] In one embodiment, the condition is cancer, and the subsets are distinguished by 
phenotypic, genotypic, and/or clinical characteristics of the cancer. In this embodiment, 
groups of individuals are classified according to one or more phenotypic, genotypic, or 
clinical characteristics relevant to the cancer into patient subsets. At any step in the process 
of subdividing a patient population into patient subsets, the expression level of one or more 
genes may be determined in order to identify whether a prognosis-informative set of genes 
may be identified for the particular patient subset. If an informative gene set is identified, but 
is not as informative as desired, the patient subset may be further divided and a new geneset 
identified. These subsets may be further subdivided. For example, a group of individuals 
affected by a particular cancer may be classified first on the basis of a phenotypic, genotypic 
or clinical characteristic A into subsets SI and S2. The levels of expression of a plurality of 
genes are then determined in tumor samples taken from individuals that fall within subsets SI 
or S2 in order to identify sets of genes informative for prognosis within these subsets. 
Subsets SI and S2 may then each be subdivided into two or more subsets based on other 
phenotypic, genotypic or clinical characteristics. The basis for subdivision, if performed, 
need not be the same for SI and S2. For example, in various embodiments, SI is not 
subdivided, while S2 is subdivided on the basis of characteristic B; or SI is subdivided based 
on characteristic B while S2 is not subdivided; or SI and S2 are both subdivided on the basis 
of characteristic B; or SI is subdivided based on characteristic B, while S2 is subdivided 
according to characteristic C; and so on. For a particular decision matrix leading to a 
plurality of patient subsets, the preferred outcome is a prognosis-informative set of genes for 
each patient subset. Different decision matrices may lead to different patient subsets, which, 
in turn, may result in different sets of prognosis-informative genes. 
[0079] In the specific example of breast cancer, a plurality of phenotypic, genotypic or 
clinical indications are used to classify a patient as being a member of one of a plurality of 
patient subsets, wherein the indications are medically, biochemically or genetically relevant 
to breast cancer. For example, a group of patients may be classified into patient subsets 
based on criteria including, but not limited to, estrogen receptor (ER) status, type of tumor 
(i.e., BRCA1 -type or sporadic), lymph node status, grade of cancer, invasiveness of the tumor, 
or age. "BRCAl-type" indicates that the BRCA1 mutation is present. In each classification 
step, a group of cancer patients may be classified into only two classes, for example, ER-1- or 
ER", or into three or more subsets (for example, by tumor grade), depending upon the 
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characteristic used to determine the subsets. As used herein, "ER+" indicates that the 
estrogen receptor is expressed at some elevated level; for example, it may indicate that the 
estrogen receptor is detectably expressed, or may indicate that more than 10% of cells are 
histologically stained for the receptor, etc. Conversely, "ER— " indicates that the estrogen 
receptor is expressed at a reduced level or not at all; for example, it may indicate that the 
receptor is not detectably expressed, or that 10% or less of cells are histologically stained for 
the receptor, etc. Marker gene sets optimized for each plrenotypic class are preferably 
determined after the subsets are established. Where informative markers for a particular 
patient subset, distinguished from another subset by a particular characteristic of the 
condition of interest, cannot be determined, the subset may be further divided by another 
characteristic of the condition to create a plurality of second patient subsets, whereupon genes 
informative for these second patient subsets may be identified. 

[0080] FIG. 1 depicts the process, described in the Examples, of subdivision of a collection 
of breast cancer patients according to phenotypic and genotypic characteristics relevant to 
breast cancer, in preparation for identification of genes informative for prognosis. A 
collection of breast cancer tumor samples was first subdivided by estrogen receptor status. 
ER status was chosen because the presence or absence of the estrogen receptor greatly 
influences the expression of other genes. In the ER+ patient subset, it was noted that patients 
appeared to be bimodally distributed by ER level vs. age; that is, ER level dependence upon 
age tended to fall within two classes, as separated by the solid line in FIG. 2A. This 
bimodality was used to further subdivide ER+ individuals into "ER+, ER/AGE high" 
individuals and "ER+, ER/AGE low" individuals. A set of informative genes was identified 
for the ER+, ER/AGE high patient subset. An informative set was not identified for the ER+, 
ER/AGE low subset, however, so the subset of patients Avas further divided into LN+ and 
LN— individuals. Thus, in one embodiment, the present invention provides a method of 
identifying a set of informative genes or markers for a condition comprising a plurality of 
phenotypic or genotypic characteristics, comprising (a) classifying each of a plurality of 
samples or individuals on the basis of one phenotypic or genotypic characteristic into a 
plurality of first classes; and (b) identifying within each of said first classes a set of 
informative genes or markers, wherein said set of informative genes or markers within each 
said first classes is unique to said class. 

5.3.2 IDENTIFICATION OF MARKER SETS INFORMATIVE FOR 
PATIENT SUBSETS 
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[0081] Once a patient subset is identified, markers, such as genes, informative for a 
particular outcome, such as prognosis, may be identified. In one embodiment, the method for 
identifying marker sets is as follows. This example describes the use of genes and gene- 
derived nucleic acids as markers; however, proteins or othter cellular constituents may be used 
as markers of the condition. 

[0082] After extraction and labeling of target polynucleotides, the expression of a plurality 
of markers, such as genes, in a sample X is compared to tbte expression of the plurality 
markers in a standard or control. In one embodiment, the standard or control comprises 
target markers, such as polynucleotide molecules, derived, from one or more samples from a 
plurality of normal individuals, or a plurality of individuals not exposed to a particular 
condition. For example, the control, or normal, individuals may be persons without the 
particular disease or condition of interest (e.g., individuals not afflicted with breast cancer, 
where breast cancer is the disease of interest), or may be an individual not exposed to a 
particular environmental condition. The standard or control may also comprise target 
polynucleotide molecules, derived from one or more samples derived from individuals having 
a different form or stage of the same disease; a different disease or different condition, or 
individuals exposed or subjected to a different condition, than the individual from which 
sample X was obtained. The control may be a sample, or- set of samples, taken from the 
individual at an earlier time, for example, to assess the progression of a condition, or the 
response to a course of therapy. 

[0083] In a preferred embodiment, the standard or control is a pool of target polynucleotide 
molecules. However, where protein levels, or the levels of any other relevant biomolecule, 
are to be compared, the pool may be a pool of proteins ox the relevant biomolecule. In a 
preferred embodiment in the context of breast cancer, the pool comprises samples taken from 
a number of individuals having sporadic-type tumors. 

[0084] In another preferred embodiment, the pool comprises an artificially-generated 
population of nucleic acids designed to approximate the level of nucleic acid derived from 
each marker found in a pool of marker-derived nucleic acids derived from tumor samples. In 
another embodiment, the pool, also called a "mathematical sample pool," is represented by a 
set of expression values, rather than a set of physical polynucleotides; the level of expression 
of relevant markers in a sample from an individual with a condition, such as a disease, is 
compared to values representing control levels of expression for the same markers in the 
mathematical sample pool. Such a control may be a set of values stored on a computer. Such 
artificial or mathematical controls may be constructed for any condition of interest. 
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[0085] In another embodiment specific to breast cancer, the pool is derived from normal or 
breast cancer cell lines or cell line samples. In a preferred embodiment, the pool comprises 
samples taken from individuals within a specific patient subset, e.g., "ER+, ER/AGE high" 
individuals, wherein each of said individuals has a good prognosis, or each of said individuals 
has a poor prognosis. Of course, where, for example, expressed proteins are used as markers, 
the proteins are obtained from the individual's sample, and the standard or control could be a 
pool of proteins from a number of normal individuals, or from a number of individuals 
having a particular state of a condition, such as a pool of samples from individuals having a 
particular prognosis of breast cancer. 

[0086] The comparison may be accomplished by any means known in the art. For example, 
expression levels of various markers may be assessed by separation of target polynucleotide 
molecules (e.g., RNA or cDNA) derived from the markers in agarose or polyacrylamide gels, 
followed by hybridization with marker-specific oligonucleotide probes. Alternatively, the 
comparison may be accomplished by the labeling of target polynucleotide molecules 
followed by separation on a sequencing gel. Polynucleotide samples are placed on the gel 
such that patient and control or standard polynucleotides are in adjacent lanes. Comparison 
of expression levels is accomplished visually or by means of densitometer. In a preferred 
embodiment, the expression of all markers is assessed simultaneously by hybridization to a 
microarray. In each approach, markers meeting certain criteria are identified as informative 
for the prognosis of breast cancer. 

[0087] Marker genes are selected based upon significant difference of expression in a 
condition, such as a disease, as compared to a standard or control condition. Marker genes 
may be screened, for example, by determining whether they show significant variation within 
a set of samples of interest. Genes that do not show a significant amount of variation within 
the set of samples are presumed not to be informative fox the disease or condition, and are not 
selected as markers for the disease or condition. Genes showing significant variation within 
the sample set are candidate informative genes for the disease or condition. The degree of 
variation may be estimated by calculating the difference of the expression of the gene, or 
ratio of expression between sample and control, within the set of samples. The expression, or 
ratio of expressions, may be transformed by any means, e.g., linear or log transformation. 
Selection may be made based upon either significant up- or down regulation of the marker in 
the patient sample. Selection may also be made by calculation of the statistical significance 
(i.e., the p-value) of the correlation between the expression of the marker and the disease and 
condition. Preferably, both selection criteria are used. Thus, in one embodiment of the 
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present invention, markers associated with prognosis of* breast cancer within a patient subset 
are selected where the markers show both more than twx>-fold change (increase or decrease) 
in expression as compared to a standard, and the p-valae for the correlation between the 
existence of breast cancer and the change in marker expression is no more than 0.01 (i.e., is 
statistically significant). 

[0088] In the context of the present invention, "good prognosis" indicates a desired outcome 
for a particular condition, especially a particular disease, and "poor prognosis" indicates an 
undesired outcome of the condition. For example, where the condition is cancer, a "good 
prognosis" may mean partial or complete remission, and "poor prognosis" may mean 
reappearance of the cancer after treatment. What constitutes "good prognosis" and "poor 
prognosis" is specific to the condition of interest, for example, specific to the particular 
cancer an individual suffers. For example, "good prognosis" for pancreatic cancer may be 
survival for one or two years after initial diagnosis, while "good prognosis" for Hodgkin's 
disease may be survival for five years or more. In the specific example of breast cancer, 
"good prognosis" means the likelihood of non-reoccurxence of metastases within a period of 
1, 2, 3, 4, 5 or more years after initial diagnosis, and "poor prognosis" means the likelihood 
of reoccurrence of metastasis within that period. In a more specific example, "good 
prognosis" means the likelihood of non-reoccurrence of metastases within 5 years after initial 
diagnosis, and "poor prognosis" means the likelihood of reoccurrence of metastasis within 
that period. 

[0089] In a more specific embodiment for cancer, for* example, breast cancer, using a 
number of breast cancer tumor samples, markers are identified by calculation of correlation 
coefficients p between the clinical category or clinical parameter(s) c and the linear, 
logarithmic or any transform of the expression ratio r across all samples for each individual 
gene. Specifically, the correlation coefficient may be calculated as: 
[0090] p = (c • r - \\r ||) Equation (1) 

[0091] Markers for which the coefficient of correlation exceeds a cutoff are identified as 
prognosis-informative markers specific for a particular clinical type, e.g., good prognosis, 
within a given patient subset. Such a cutoff or threshold may correspond to a certain 
significance of discriminating genes obtained by Monte Carlo simulations. The threshold 
depends upon the number of samples used; the threskold can be calculated as 3 X 1 / Vn — 3 , 
where 1 / is the distribution width and n = the number of samples. In a specific 
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embodiment, markers are chosen if the correlation coefficient is greater than about 0.3 or less 
than about -0.3. 

[0092] Next, the significance of the correlation is calculated. This significance may be 
calculated by any statistical means by which such significance is calculated. In a specific 
example, a set of correlation data is generated using a Monte-Carlo technique to randomize 
the association between the expression difference of a particular marker and the clinical 
category. The frequency distribution of markers satisfying the criteria in the Monte-Caorlo 
runs is used to determine whether the number of markers selected by correlation with clinical 
data is significant. 

[0093] Once a marker set is identified, the markers may be rank-ordered in order of 
significance of discrimination. One means of rank ordering is by the amplitude of correlation 
between the change in gene expression of the marker and the specific condition being 
discriminated. Another, preferred, means is to use a statistical metric. In a specific 
embodiment, the metric is a t-test-like statistic: 

[0094] tJ&yMVfi— r Equation^) 

/VPl <"1 - 1 )+^(^2 - W*l + "2 -0/(1/*! + 0f 2 ) 

[0095] In this equation, (x l ) is the error- weighted average of the log ratio of transcript 
expression measurements within a first clinical group (e.g., good prognosis), (* 2 )is th_<? error- 
weighted average of log ratio within a second, related clinical group (e.g., poor prognosis), 
a x is the variance of the log ratio within the first clinical group (e.g., good prognosis), ri\ is 
the number of samples for which valid measurements of log ratios are available, <x 2 is the 
variance of log ratio within the second clinical group (e.g., poor prognosis), and n 2 is the 
number of samples for which valid measurements of log ratios are available. The f-vaJue 
represents the variance-compensated difference between two means. 

[0096] The rank-ordered marker set may be used to optimize the number of markers in the 
set used for discrimination. This is accomplished generally in a "leave one ouf 9 method as 
follows. In a first run, a subset, for example five, of the markers from the top of the ranked 
list is used to generate a template, where out of X samples, X-l are used to generate ttie 
template, and the status of the remaining sample is predicted. This process is repeated for 
every sample until every one of the X samples is predicted once. In a second run, additional 
markers, for example five additional markers, are added, so that a template is now generated 
from 10 markers, and the outcome of the remaining sample is predicted. This process is 
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repeated until the entire set of markers is used to generate the template. For each of the runs, 
type 1 error (false negative) and type 2 errors (false positive) are counted; the optimal number 
of markers is that number where the type 1 error rate, or type 2 error rate, or preferably the 
total of type 1 and type 2 error rate is lowest. 

[0097] For prognostic markers, validation of the marker set may be accomplished Tsy an 
additional statistic, a survival model. This statistic generates the probability of tumor distant 
metastases as a function of time since initial diagnosis. A number of models may be used, 
including Weibull, normal, log-normal, log logistic, log-exponential, or log-Rayleigh 
(Chapter 12 "Life Testing", S-PLUS 2000 Guide to Statistics, Vol. 2, p. 368 (2000)). For 
the "normal" model, the probability of distant metastases P at time t is calculated as 
[0098] P = axexp(-r 2 /r 2 ) Equation 

(3) 

[0099] where a is fixed and equal to 1 , and T is a parameter to be fitted and measures the 
"expected lifetime". 

[00100] It is preferable that the above marker identification process be iterated one or more 
times by excluding one or more samples from the marker selection or ranking {i.e., from the 
calculation of correlation). Those samples being excluded are the ones that can not be 
predicted correctly from the previous iteration. Preferably, those samples excluded, from 
marker selection in this iteration process are included in the classifier performance 
evaluation, to avoid overstating the performance. 

[00101] It will be apparent to those skilled in the art that the above methods, in particular the 
statistical methods described above, are not limited to the identification of markers associated 
with the prognosis of breast cancer within a particular patient subset, but may be used to 
identify set of marker genes associated with any phenotype or condition, or with amy subset 
of a phenotype or condition defined by one or more characteristics of the phenotype or 
condition. The phenotype or condition can be the presence or absence of a disease such as 
cancer, or the presence or absence of any identifying clinical condition associated with that 
cancer. In the disease context, the phenotype may be a prognosis such as a survival time, 
probability of distant metastases of a disease condition, or likelihood of a particular response 
to a therapeutic or prophylactic regimen. The phenotype need not be cancer, or a disease; the 
phenotype may be a nominal characteristic associated with a healthy individual. 
[00102] Thus, the invention provides a method of identifying a set of informative genes or 
markers for a condition comprising a plurality of phenotypic of genotypic characteristics, 
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comprising: (a) classifying each of a plurality of samples or individuals on the basis of one or 
more phenotypic or genotypic characteristics of said condition into a plurality of first classes; 
(b) identifying within each of said first classes a first set of genes or markers informative for 
said condition, wherein said first set of genes or markers within each of said first classes is 
unique to said class relative to other classes. In a specific embodiment, samples or 
individuals in at least one of said first classes are additionally classified on the basis of a_ 
phenotypic or genotypic characteristic different from that used to distinguish said first classes 
into a plurality of second classes, and identifying within at least one of said second classes a 
second set of informative genes or markers, wherein said second set of informative genes or 
markers within each of said second classes is unique to said second class relative to other 
classes. In another embodiment, the invention provides a method of identifying a set of 
informative genes or markers for a condition comprising a plurality of phenotypic or 
genotypic characteristics, comprising: (a) classifying each of a plurality of samples or 
individuals on the basis of one or more phenotypic or genotypic characteristics into a 
plurality of first classes; (b) classifying at least one of said first classes into a plurality of 
second classes on the basis of phenotypic or genotypic characteristic different than that used 
to distinguish said plurality of first classes; (c) identifying within at least one of said first 
classes or said second classes a set of genes or markers informative for said condition, 
wherein said set of genes or markers is unique to said class relative to other classes. Tire 
invention further provides a method of identifying a set of informative genes or markers for a 
condition comprising a plurality of phenotypic or genotypic characteristics, comprising z (a) 
selecting a first characteristic from said plurality of phenotypic or genotypic characteristics; 
(b) identifying at least two first condition classes differentiable by said first characteristic; (c) 
selecting a plurality of individuals classifiable into at least one of said first condition classes; 
and (d) identifying in samples derived from each of said plurality of individuals a set of genes 
or markers informative for said condition within said at least one of said first condition 
classes. 

5.3.3 CLASSIFIER GENESETS FOR FIVE PATIENT SUBSETS 
[00103] The present invention provides sets of markers useful for the prognosis of breast 
cancer. The markers were identified according to the above methods in specific subsets of 
individuals with breast cancer. Generally, the marker sets were identified within a population 
of breast cancer patients that had been first stratified into five phenotypic categories based on 
criteria relevant to breast cancer prognosis, including estrogen receptor (ER) status, lyrxiph 
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node status, type of mutation(s) (i.e., BRCAl-type or sporadic) and age at diagnosis. More 
specifically, patients, and tumors from which samples were taken, were classified as ER~ 
sporadic {i.e., being both estrogen receptor negative and having a non-BRCAl -type tumor); 
ER^, BRCA1 (i.e., being both estrogen receptor negative and having a BRCAl-type tumor); 
ER+, ER/AGE high (i.e., estrogen receptor positive with a high ratio of the log (ratio) of 
estrogen receptor gene expression to age); ER+, ER/AGE low, LN+ (i.e., estrogen receptor 
positive with a low ratio of the log (ratio) of estrogen receptor gene expression to age, lymph 
node positive); and ER + , ER/AGE low, LN~~ (i.e., estrogen receptor positive with a low ratio 
of the log (ratio) of estrogen receptor gene expression to age, lymph node negative). The 
rationale for subdivision of the original patient set into these five subsets is detailed in the 
Examples (Section 6). The marker sets useful for each of the subsets above are provided in 
Tables 1-5, respectively. 



Table 1: Geneset of 20 markers used to classify ER , sporadic individuals. 



Accession/ 
Contig No. 


Gene 


Avg 

good 

xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp_xrefJkeywordJist 


SEQ ID 


AF055033 


IGFBP5 


-2,12 


0.88 


0.54 


insulin-like 
growth factor 
binding protein 5 


Growth factor binding, 
Glycoprotein, Signal, 3D- 
structure 


11 


NM_000599 


IGFBP5 


-3.41 


0.43 


0.53 


insuiin-like 
growth factor 
binding protein 5 


Growth factor binding, 
Glycoprotein, Signal, 3D- 
structure 


51 


L27560 


IGFBP5 


-4.55 


0 


0.52 


EST 


Hypothetical protein 


29 


AF052162 


FLJ 12443 


-0.27 


1.6 


0.52 


EST 


Hypothetical protein 


9 


NM_001456 


FLNA 


-0.61 


2.47 


0.52 


filamin A, alpha 
(actin binding 
protein 280) 


Hypothetical protein, 
Actin-binding, 
Phosphorylation, Repeat, 
Polymorphism, Disease 
mutation 


, 73 


NM_002205 


ITGA5 


-0.37 


2.08 


0.49 


integrin, alpha 5 
(fibronectin 
receptor, alpha 
polypeptide) 


integrin, Cell adhesion, 
Receptor, Glycoprotein, 
Transmembrane, Signal, 
Calcium, Repeat 


93 


NM_0 13261 


PPARGC1 


0.09 


1.54 


0.47 


peroxisome proliferative activated receptor, 
gamma, coactivator 1 


231 


NM_001605 


AARS 


0.39 


2.36 


0.51 


alanyl-tRNA 
synthetase 


Aminoacyl-tRNA 
synthetase, Protein 
biosynthesis, Ligase, 
ATP-binding 


77 


X87949 


HSPA5 


-0.03 


2.03 


0.49 


heat shock 
70kDa protein 5 
(glucose- 
regulated 
protein, 78kDa) 


ATP-binding, Hypothetical 
protein, Endoplasmic 
reticulum, Signal 


273 


Contig50950_RC 


NGEF 


-1.17 


3.2 


0.52 


neuronal guanine nucleotide exchange 
factor 


337 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp xref_keyword_list 


SEQ ID 


NM_005689 


ABCB6 


-0.51 


2.26 


0.48 


ATP-binding 
cassette, sub- 
family B 
(MDR/TAP), 
member 6 


ATP-binding, Transport, 
Transmembrane, 
Mitochondrion, Inner 
membrane, Transit 
peptide, Hypothetical 
protein 


187 


NM_004577 


PSPH 


-0.56 


3.05 


0.51 


phosphoserine 
phosphatase 


Hydrolase, Serine 
biosynthesis, Magnesium, 
Phosphorylation 


151 


NM_003832 


PSPHL 


-2.08 


2.18 


0.5 


phosphoserine 

phosphatase- 

like 




131 


NM_002422 


MMP3 


-0.96 


2.54 


0.5 


matrix 

metalloproteinas 
e 3 (stromelysin 
1 , progelatinase) 


Hydrolase, 
Metalloprotease, 
Glycoprotein, Zinc, 
Zymogen, Calcium, 
Collagen degradation, 
Extracellular matrix, 
Signal, Polymorphism, 
ou-siruciure 


101 


Contia37 r >62 RH 




-3 4? 




-0 *iQ 

\J.\J\3 


1 — O 1 O 






NM_018465 


MDS030 


-0.82 


-3.28 


-0.58 


uncharacterized 

nemaiopoietic 

stem/progenitor 

L^tJIlo fJIUlcMM 

MDS030 


Hypothetical protein 


267 


Contig54661_RC 




-0.79 


-2.08 


-0.54 


ESTs 




349 


AB032969 


KIAA1143 


-0.6 


-2.85 


-0.53 


K1AA1143 
protein 


Hypothetical protein 


1 


Contig55353_RC 


KIAA1915 


-0.27 


-1.82 


-0.47 


KIAA1915 
protein 


Hypothetical protein 


353 


NM_005213 


CSTA 


2.11 


-3.4 


-0.49 


cystatin A (stefin 
A) 


Thiol protease inhibitor, 
3D-structure 


175 



Table 2. Geneset of 10 markers used to classify ER , BRCA1 individuals. 



Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Sequence 
name 


Description 


Sp xref keywo 
rdjist 


SEQ ID 


AF005487 




6.08 


0.5 


-0.79 


HLA-DRB6 


Homo sapiens 
MHC class II 
antigen (DRB6) 
mRNA, HLA- 
DRB6*0201 allele, 
sequence. 


MHC 


3 


Contig50728_RC 
Contig53598_RC 




4.02 


0.25 


-0.77 




ESTs, Weakly similar to S26650 
DNA-binding protein 5 - human 
[H. sapiens] 


333 


8.41 


3.26 


-0.77 


FLJ11413 


hypothetical protein 
FLJ11413 


Hypothetical 
protein 


343 


NMJ302888 


RARR 
ES1 


6.9 


0.05 


-0.87 


RARRES1 


retinoic acid 
receptor respo rider 
(tazarotene 
induced) 1 


Receptor, 
Transmembrane 
, Signal-anchor 


109 
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NM_005218 


DEFB1 


5.14 


-3.02 


-0.81 


DEFB1 


defensin, beta 1 


Antibiotic, 
Signal, 3D- 
structure 


177 


U 17077 


BENE 


2.72 


-1.72 


-0.77 


BENE 


BENE protein 


Transmembrane 


271 


Contig14683_RC 




1.29 


-2.31 


-0.74 




ESTs 




279 


Contig53641_RC 




-3.29 


4.23 


0.75 


MAGE-E1 


MAGE-E1 protein 


Hypothetical 
protein 


345 


Contig56678_RC 




-6.7 


-9.73 


-0.82 




ESTs, Highly similar to 
THYAJHUMAN Rrothymosin alpha ! 
[H. sapiens] 


357 


NM_005461 


KRML 


0.88 


-3.38 


-0.75 


MAFB 


v-maf 

musculoaponeuroti 
c fibrosarcoma 
oncogene homolog 
B (avian) 


Transcription 

regulation, 

Repressor, 

DNA-binding, 

Nuclear protein, 

Hypothetical 

protein 


181 



Table 3. Geneset of 50 markers used to classify ER+ 5 ER/AGE high individuals. 



Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp xref keyword 
Jist 


SEQ ED 


NM_003600 


STK15 


-2.93 


2.08 


0.8 


serine/threonine 
kinase 6 


^TP-binding, 
Kinase, 

Serine/threonine- 
protein kinase, 
Transferase 


125 


NM_003158 


STK6 


-1.57 


1.42 


0.78 


serine/threonine 
kinase 6 


yVTP-binding, 
Kinase, 

Serine/threonine- 
protein kinase, 
Transferase 


113 


NMJ307019 


UBCH10 


-2.98 


2.62 


0.81 


ubiquitin-conjugating 
enzyme E2C 


Hypothetical 
protein, UbI 
conjugation 
pathway, Ligase, 
Multigene family, 
Mitosis, Cell cycle, 
Cell division 


217 


NM_013277 


ID-GAP 


-2.43 


2.43 


0.77 


Rac GTPase 
activating protein 1 


Hypothetical 
protein 


233 


NMJD04336 


BUB1 


-2.04 


139 


0.77 


BUB1 budding 
uninhibited by 
benzimidazoles 1 
homolog (yeast) 


Transferase, 
Serine/threonine- 
protein kinase, 
ATP-binding, Cell 
cycle, Nuclear 
protein, Mitosis, 
Phosphorylation, 
Polymorphism 


147 
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> Accession/ 
' Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp xref keyword 
_list 


SEQ ID 


NM_006607 


PTTG2 


-1.71 


1.49 


0.72 


pituitary tumor- 
transforming 2 




211 


AK001166 


FLJ 11252 


-1.33 


0.99 


0.71 


hypothetical protein 
FLJ 11252 


Hypothetical 
protein 


13 


NM_004701 


CCNB2 


-4.62 


2.01 


0.81 


cyclin B2 


Cyclin, Cell cycle, 
Cell division, 
Mitosis 


153 


Contig57584_RC 




-3.68 


2.04 


0.78 


likely ortholog of mouse gene rich 
cluster, C8 gene 


359 


NM_006845 


KNSL6 


-4.13 


1.05 


0.73 


Klnesm-llKe o ^miiouc 
centromere- 
associated kinesin) 


nypoineucai 
protein, Motor 
protein, 

Microtubules, ATP- 

KinHinn (~* r~\\ loH r*oi I 
uniuiiiy, ouiicu ouu, 

Nuclear protein 


215 


Contig38901_RC 




-3.08 


1.15 


0.75 


hypothetical protein 

1 V I VJ3 0*+ O O DO 


Hypothetical 

pi Ulcll 1 


299 


NM_018410 


DKFZp76 
2E1312 


-4.38 


1.49 


0.75 


hypothetical protein 
DKFZp762E1312 


Hypothetical 
protein 


263 


NM_003981 


PRC1 


-3.52 


2.17 


0.78 


protein regulator of 
cytokinesis 1 




133 


NM_001809 


CENPA 


-5.04 


0.98 


0.75 


centromere protein 
A, l7kDa 


Hypothetical 
protein, 
Chromosomal 
protein, Nuclear 
protein, lmnm- 
binding, 
Centromere, 
Antigen 


81 


NM_003504 


CDC45L 


-2.67 


1.22 


0.73 


CDC45 cell division 
cerevisiae) 


DNA replication, 

Otrll Uyoio, ivuOlccH 

protein, Cell 
division 


123 


Contig41413_RC 




-5.43 


2.15 


0.74 


ribonucleotide 
reductase M2 
polypeptide 


Oxidoreductase, 
DNA replication, 
Iron 


305 


NM_004217 


STK12 


-2.17 


0.73 


0.72 


serine/threonine 
kinase 12 


Hypothetical 
protein, ATP- 
binding, Kinase, 
Serine/threonine- 
protein kinase, 
Transferase 


143 


NM_002358 


MAD2L1 


-2.65 


2.27 


0.83 


MAD2 mitotic arrest 
deficient-like 1 
(yeast) 


Cell cycle, Mitosis, 
Nuclear protein, 
3D-structure 


99 


NM_0 14321 


ORC6L 


-2.73 


1.8 


0.75 


origin recognition 
complex, subunit 6 
homolog-like (yeast) 


Hypothetical 
protein, DNA 
replication, Nuclear 
protein, DNA- 
binding 


241 


NM_0 12291 


KIAA0165 


-1.52 


1.55 


0.71 


extra spindle poles 
like 1 (S. cerevisiae) 


Hypothetical 
protein 


229 
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Accession/ 
Contig No. 


Gene 


Avg 

good 

xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp xref keyword 
Jist 


SEQID 


NM_004203 


PKMYT1 


-3.64 


2.2 


0.7 


retinoblastoma-like 2 
(p130) 


ATP-binding, 
Kinase, 

Serine/threonine- 
f^roteln kinase, 
"Transferase, 
Transcription 
regulation, DNA- 
binding, Nuclear 
protein, Cell cycle, 
Phosphorylation, 
yAnti-oncogene 


137 


M96577 


E2F1 


-2.14 


1.42 


0.75 


E2F transcription 
factor 1 


Transcription 
regulation, 
y\ctivator, DNA- 
binding, Nuclear 
protein, 

Phosphorylation, 
Cell cycle, 
.Apoptosis, 
Polymorphism 


33 


NM_002266 


KPNA2 


-3.77 


1.78 


0.71 


karyopherin alpha 2 
(RAG cohort 1, 
importin alpha 1) 


Transport, Protein 
transport, Repeat, 
Nuclear protein, 
Polymorphism 


95 


Contig31288_RC 




-2.63 


0.7 


0.68 


ESTs, Weakly similar to hypothetical 
protein FLJ20489 [Homo sapiens] 
[H. sapiens] 


289 


NM_014501 


E2-EPF 


-1.55 


1.93 


0.7 


ubiquitin carrier 
protein 


Ubl conjugation 
pathway, Ligase, 
Multigene family 


247 


NM 001168 


BIRC5 


-5.76 


2.01 


0.78 


baculoviral IAP 
repeat-containing 5 
(survivin) 


Apoptosis, Thiol 
protease inhibitor, 
Alternative splicing, 
3D-structure, 
Hypothetical 
protein, Protease, 
Receptor 


63 


NM_003258 


TK1 


-4.57 


1.38 


0.71 


thymidine kinase 1, 
soluble 


Transferase, 
Kinase, DNA 
synthesis, ATP- 
binding 


115 


NM_001254 


CDC6 


-2.46 


0.28 


0.72 


CDC6 cell division 
cycle 6 homolog (S. 
cerevisiae) 


ATP-binding, Cell 
division 


; 67 ; 


NM_004900 


DJ742C19 
.2 


-2.96 


0.13 


0.69 


apolipoprotein B 
mRNA editing 
enzyme, catalytic 
polypeptide-like 3B 


Hydrolase 


161 


NM_004702 


CCNE2 


-3.12 


2.13 


0.81 


cyclin E2 


Cell cycle, Cell 
division, Cyclin, 
Hypothetical 
protein, 

Phosphorylation, 
Alternative splicing, 
Nuclear protein 


155 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp xref keyword 
Jist 


SEQ ID 


AL160131 




-3.07 


2.42 


0.7 


hypothetical protein 
MGC861 


Hypothetical 
protein 


21 


NM_016359 


LOC5120 
3 


-3.22 


2.61 


0.76 


nucleolar protein 
ANKT 


Hypothetical 
protein, Nuclear 

nrntf^in 


253 


NM_004856 


KNSL5 


-1.52 


1.1 


0.71 


kinesin-like 5 (mitotic 
kinesin-like protein 
1) 


Motor protein, Cell 
division, 

Microtubules, ATP- 
binding, Coiled coil, 
Mitosis, Cell cycle, 
Nuclear protein 


159 


INlVI UUUUO/ 


Rl iv yi 




U. / D 


U. f 1 


oioom syna rorne 


nyu roiase, 
Helicase, ATP- 
binding, DNA- 

protein, DNA 
replication, 
Disease mutation 


oo 


NM_0 18455 


BM039 


-2.44 


1.18 


0.7 


uncharacterized 
bone marrow protein 




265 


NM_002106 


H2AFZ 


-2.49 


1.53 


0.72 


H2A histone family, 
iiiciiiijc;! it- 


Chromosomal 

yji ulci i I, 

Nucleosome core, 
Nuclesr protein, 
DNA-fc>inding, 
Multig ene family 


91 


Contig64688 




-2.68 


3.1 


0.73 


hypothetical protein 
FLJ23468 


Hypothetical 
protei n 


365 


Contig44289_RC 




-1.65 


1.6 


0.67 


ESTs 




315 






-1 ^7 
I .o / 




u.uo 


diaphanous homolog 
3 (Drosophila) 


Hypothetical 
protei n, Coiled coil, 
Repeat, Alternative 
splicing 




Contig46218_RC 




-1.31 


1.56 


0.68 


ESTs, Weakly similar to T1 S201 
hypothetical protein C11GS.3 - 
Caenorhabditis elegans [C. elegans] 


321 


OOnug/iOc/H / r\0 




- I .o 


u.yo 


n «7 


cell division cycle 
25A 


Hypothetical 
protein, Cell 
division, Mitosis, 
Hydrolase, 
Alternative splicing, 
Multigene family, 
3D-structure 


ZOO 


NM_016095 


LOC516S 
9 


-1.4 


2.13 


0.67 


HSPC037 protein 


Hypothetical 
protein 


249 


NM_003090 


SNRPA1 


-3.26 


0.95 


0.7 


small nuclear 
ribonucleoprotein 
polypeptide A' 


Hypothetical 
protein, Nuclear 
protein, RNA- 
binding, 

Ribonucleoprotein, 
Leucine-rich 
repeat, Repeat, 
3D-structure 


111 j 
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Accession/ 
Contig No. 


Gene 


Avg 

good 

xdev 


Avg 
poor 
xdev 


Corre- 
lation 


Description 


Sp xref keyword 
Jist 


SEQ ID 


NM_002811 


PSMD7 


-2.48 


1.89 


0.7 


proteasome 
(prosome, 
macropaln) 26S 
subunit, non- 
ATPase, 7 (Mov34 
homolog) 


Proteasome 


107 j 


Contig38288_RC 




-2.34 


0.97 


0.67 


hypothetical protein 
DKFZp762A2013 


Hypothetical 
protei n 


297 


NM_0034O6 


YWHAZ 


-1.5 


2.79 


0.68 


tyrosine 3- 
monooxygenase/tryp 
tophan 5- 
monooxygenase 
activation protein, 
zeta polypeptide 


Brain, Neurone, 
Phosphorylation, 
Acetyl ation, 
Multigene family, 
3D-structure 


121 


AL137540 


NTN4 


2.13 


-4.61 


-0.69 


netrin 4 


Hypothetical 
protein, Laminin 
EGF-like domain, 
Signal 


19 


AL04936Z 




1.9 


-3.2 


-0.68 


EST 


Transducer, 
Prenylation, 
Lipoprotein, 
Multigene family, 
Acet/lation 


15 


NMJ3134C9 


FST 


1.04 


-5.78 


-0.69 


follistatin 


Glycoprotein, 
Repeat, Signal, 
ARematlve splicing 


235 


NM_000O60 


BTD 


3.1 


-1.45 


-0.67 


biotinidase 


Hydrolase, 
Glycoprotein, 
Signal, Disease 
mutation 


37 



Table 4. Geneset of 50 markers used to classify ER+, ER/AGE low, LN+ individuals. 



Accession/ 


Gene 


Avg 


Avg 


Correl- 


Description 


Sp xref keyword 


SEQ ID 


Contig No. 




good 


poor 


ation 




Jist 








xdev 


xdev 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Description 


Sp xref keyword 
J 1st 


SEQ ID 


NMJD06417 


MTAP44 


-1.5 


3 


0.69 


Fc fragment of IgG, 
low affinity Mb, 
receptor for (CD32) 


Hydrolase, 

Hypothetical 

protein, 

Immunoglobulin 
domain, IgG- 
binding protein, 
Receptor, 
Transmembrane, 
Glycoprotein, 
Signal, Repeat, 
Multigen^ family, 
Polymorphism, 
NAD, One-carbon 
metabolism, Serine 
protease, 
Zymogen , 

D o 

ri OteaSc, 

Alternative splicing, 

Ul If Lm i lvJo«^/i I idl 

translocation, 
Proto-oncogene, 
Galaptin, Lectin, 
Antiaen 


205 


NM_006820 


GS3686 


-4.3 


4.06 


0.69 


chromosome 1 open 
reading frame 29 


Hypothetical 
protein 


213 


NM_001548 


IFIT1 


-3.4 


4.27 


0.71 


Interferon-induced 
protein with 
tetratricopeptide 
repeats 1 


Repeat, TPR 
repeat, Interferon 
induction 


75 


Contig41538_RC 




-2.5 


3.16 


0.68 


ESTs, Moderately similar to hypothetical 
protein FLJ20489 [Homo sapiens] 


307 


NM_016816 


OAS1 


-1.7 


3.29 


0.75 


2\5'-oligoadenylate 
synthetase 1 , 
40/46kDa 


RNA-binding, 
Transferase, 
N ucleotid y Itransfer 
ase, Interferon 
induction , 
Alternative splicing 


255 


Contig51660_RC 




-2.1 


2.65 


0.66 


28kD interferon 
responsive protein 


Transmembrane 


339 


Contig43645_RC 




-4.8 


1.44 


0.63 


Homo sapiens, 
clone 

IMAGE:4428577, 
mRNA, partial cds 


Hypothetical 
protein 


313 


AF026941 




-4.6 


2.71 


0.63 


EST, Weakly similar 
to 2004399A 
chromosomal 
protein [Homo 
sapiens] 


Hypothetical 
protein 


5 


NM_007315 


STAT1 


-3.5 


1.8 


0.59 


signal transducer 
and activator of 
transcription 1, 
91kDa 


Transcription 
regulation, DNA- 
binding, FMucJear 
protein, 

Phosphorylation, 
SH2 donnain, 
Alternative splicing, 
3D-structure 


225 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


/Vvg 
poor 
x:dev 


Correl- 
ation 


Description 


Sp xref keyword 
Jist I 


SEQ ID 


NM_002038 


G1P3 


-4.1 


5.64 


0.79 


interferon, alpha- 
inducible protein 
(clone IFI-6-16) 


Interferon 
induction, 
Transmembrane, 
Signal, Alternative 
splicing 


85 


NM_005101 


ISG15 


-5.6 


5.34 


0.77 


interferon-stimulated 
protein, 15 kDa 


Interferon 
induction, Repeat 


169 


NM_002462 


MX1 


-6.1 


0.83 


0.56 


myxovirus (influenza 
virus) resistance 1 , 
interferon-inducible 
protein p78 (mouse) 


Hypothetical 
protein, Interferon 
induction, GTP- 
binding, Multigene 
family, Antiviral 


103 


NM_005532 


IFI27 


-5.8 


2.81 


0.59 


interferon, alpha- 
inducible protein 27 


Interferon 
induction, 
Transmembrane 


183 


NM_002346 


LY6E 


-2.1 


3.58 


0.75 


lymphocyte antigen 
6 complex, locus E 


Signal, Antigen, 
Multigene family, 
Membrane, GPI- 
anchor 


97 


NM_016817 


0AS2 


-3.6 


1.89 


0.59 


2'-5'-oligoadenylate 

o y I i li iciaov £— , 

69/71 kDa 


RNA-binding, 

T n & ^ e» 

Nucleotidy [transfer 
ase Reoeat 
Interferon 
induction, 

Alternative splicing, 
Myristate 


257 


Contia44909 RC 




-2.3 


1.13 


0.55 


hvDOthetical orotein 
BC012330 


Hvoothetical 
protein 


317 


NMJ)17414 


USP18 


-4.1 


3.37 


0.72 


ubiquitin specific 
protease 1 8 


Ubl conjugation 
pathway, 
Hydrolase, Thiol 
protease, 
Multigene family 


259 


NM_004029 


IRF7 


-2.4 


3.67 


0.66 


interferon regulatory 
factor 7 


Collagen, 
Transcription 
regulation, DNA- 
binding, Nuclear 
protein, Activator, 
Alternative splicing 


135 


NMJD04335 


BST2 


-3.2 


3.22 


0.57 


bone marrow 
stroma! cell antigen 
2 


Transmembrane, 
Glycoprotein, 
Signal-anchor, 
Polymorphism 


145 


NMJ)02759 


PRKR 


-2.4 


1.8 


0.58 


protein kinase, 
interferon-inducible 
double stranded 
RNA dependent 


Transferase, 
Serine/threonine- 
protein kinase, 
ATP-binding, 
Repeat, 

Phosphorylation, 
Interferon 
induction, RNA- 
binding, 3D- 
structure 


105 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Description 


Sp xref keyword 
Jist 


SEQID 


NM 006332 


IFI30 


-3.8 


2.65 


0.64 


interferon, gamma- 
inducible protein 30 


Oxidored uctase, 

Interferon 

induction, 

Glycoprotein, 

Lysosome, Signal, 

Hypothetical 

protein 


203 


NM_009587 


LGALS9 


-3.2 


2.08 


0.6 


lectin, galactoside- 
binding, soluble, 9 
(galectin 9) 


Galaptin, Lectin, 
Repeat, Alternative 
splicing 


227 


NM_003641 


IFITM1 


-2.4 


5.54 


0.63 


interferon induced 
transmembrane 
protein 1 (9-27) 


Interferon 
induction, 
Transmembrane 


127 


NM_0 17523 


HSXIAPA 
F1 


-1 


2.84 


0.7 


XIAP associated 
factor-1 


Hypothetical 
protein 


261 


NM_014314 


RIG-I 


-1.3 


3.55 


0.62 


RNA helicase 


ATP-binding, 

Helicase, 

Hydrolase, 

Hypothetical 

protein 


239 


Contig47563_RC 




-2.2 


3.11 


I 0.56 


ESTs 




1 325 


AI497657_RC 




-4.4 


5.61 


0.74 


guanine nucleotide 
binding protein 4 


Transducer, 
Prenylation, 
Lipoprotein, 
Multigene family 


335 


NM_000735 


CGA 


-4.3 


2.5 


0.58 


glycoprotein 
hormones, alpha 
polypeptide 


Hormone, 
Glycoprotein, 
Signal, 3D- 
structure 


53 


NM_004988 


MAGEA1 


-1.4 


6.31 


0.64 


melanoma antigen, 
family A, 1 (directs 
expression of 
antigen MZ2-E) 


Antigen, Multigene 
family, 

Polymorphism, 
Tumor antigen 


163 


Contig54242_RC 




-1.2 


4.1 


0.65 


chromosome 17 
open reading frame 
26 


Hypothetical 
protein 


347 


NM_004710 


SYNGR2 


-1.4 


3.01 


0.54 


synaptogyrin 2 


Transmembrane 


157 


NM 001168 


BIRC5 


-3.7 


3.39 


0.64 


baculoviral IAP 
repeat-containing 5 
(survivin) 


Hypothetical 
protein, Protease, 
Receptor, 
Apoptosis, Thiol 
protease inhibitor, 
Alternative splicing, 
3D-structure 


63 


Contig41413__RC 




-4.4 


2.61 


0.57 


ribonucleotide 
reductase M2 
polypeptide 


Oxidoreductase, 
DNA replication, 
Iron 


305 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Description 


Sp xref keyword 
Jist 


SEQ ID 


NM_004203 


PKMYT1 


-3.4 


3.79 


0.6 


retinoblastoma-like 2 
( P 130) 


ATP-binding, 
Kinase, 

Serine/threonine- 
protein kinase, 
Transferase, 
Transcription 
regulation, DNA- 
binding, Nuclear 
protein, Cell cycle, 
Phosphorylation, 
Anti-oncogene 


137 


Contig48913_RC 




-3.1 


1.72 


0.55 


Homo sapiens, Similar to hypothetical 
protein PR0 1722, clone MGC: 15692 
IMAGE :3351479, mRNA, complete cds 


327 


NM_005804 


DDXL 


-2.5 


1.42 


0.58 


DEAD/H (Asp-Glu- 
Ala-Asp/His) box 
polypeptide 39 


ATP-binding, 

Helicase, 

Hydrolase, 

Hvnothetical 

protein 


191 


NM_0 16359 


LOC5120 
3 


-1.7 


3.6 


0.57 


nucleolar protein 
ANKT 


Hypothetical 
protein, Nuclear 
protein 


253 


NM_001645 


APOC1 


-2.9 


3.43 


0.58 


apolipo protein C-l 


Plasma, Lipid 
transport, VLDL, 
Signal, 3D- 
structure, 
Polymorphism 


79 


Contig37895_RC 




-2 


2.05 


0.55 


ESTs 




295 


NM_005749 


TOB1 


-1.3 


4.96 


0.59 


transducer of 
ERBB2, 1 


Phosphorylation 


189 


NMJ300269 


NME1 


-1.3 


2.98 


0.55 


non-metastatic cells 
1, protein (NM23A) 
expressed in 


Transferase, 
Kinase, ATP- 
binding, Nuclear 
protein, Anti- 
oncogene, Disease 
mutation 


39 


NM_0 14462 


LSM1 


-1 


4.5 


0.57 


Lsm1 protein 


Nuclear protein, 
Ribonucleoprotein, 
mRNA splicing, 
mRNA processing, 
RNA-binding 


245 


Contig31221_RC 




-1.4 


3.83 


0.56 


HTPAP protein 




287 


NM_005326 


HAGH 


-1.9 


4.29 


0.57 


hydroxyacyl 

glutathione 

hydrolase 


Hydrolase, Zinc, 
3D-structure 


179 


Contig42342_RC 




0.78 


-3.2 


-0.6 


Homo sapiens cDNA 
FLJ39*17fis, clone 
PLACE6016942 


Hypothetical 
protein 


311 


AL1 37540 


NTN4 


2.24 


-3.9 


-0.6 


netrin 4- 


Laminin EGF-like 
domain, Signal, 
Hypothetical 
protein 


19 


Contig40434_RC 




1.64 


-5.6 


-0.6 


wingless-type MMTV 
integration site 
family, member 5A 


Developmental 
protein, 
Glycoprotein, 
Signal 


301 
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Accession/ 
Contig No. 


Gene 


Avg 

good 

xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Description 


Sp xref keyword 
Jist 


SEQ ID 


Contig 1632JRC 




1.03 


-3.9 


-0.6 


hypothetical protein 
MGC17921 


Hypothetical 
protein 


275 


NM_0 14246 


CELSR1 


0.95 


-4.6 


-0.6 


cadherin, EGF LAG 
seven-pass G-type 
receptor 1 (flamingo 
homolog, 
Drosophila) 


G-protein coupled 

receptor, 

Transmembrane, 

Glycoprotein, EGF- 

like domain, 

Calcium-binding, 

Laminin EGF-like 

domain, Repeat, 

Developmental 

protein, 

Hydroxylation, 

Signal, Alternative 

splicing, 

Hypothetical 

protein 


237 


NM_005139 


ANXA3 


1.26 


-6.2 


-0.6 


annexin A3 


Annexin, 

v^aiciurn/pnoojjr luii 

pid-binding, 

Repeat, 

Phospholipase A2 
inhibitor, 3D- 
structure, 
Polymorphism 


171 



Table 5. Geneset of 65 markers used to classify ER+, ER/AGE low, LN individuals. 



Accession/ 
Contig No. 


Gene 


Avg 

good 

xdev 


Avg 

poor 

xdev 


Correl- 
ation 


Sequence 
name 


Description 


Sp xref keywo 
rdjist 


SEQ ID 


M55914 


MPB1 


-2.82 


1.25 


0.5 


EN01 


enolase 1 , (alpha) 


DNA-binding, 

Transcription 

regulation, 

Repressor, 

Nuclear protein, 

Lyase, 

Glycolysis, 

Magnesium, 

Multigene 

family, 

Hypothetical 

protein 


31 


NMJ305945 


MPB1 


-3.06 


1.19 


0.49 


EN01 


Homo sapiens 
enolase 1, (alpha) 
(ENQ1), mRNA. 


Glycolysis, 

Hypothetical 

protein, Lyase, 

Magnesium, 

DNA-binding, 

Transcription 

regulation, 

Repressor, 

Nuclear protein, 

Multigene family 


193 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Sequence 
name 


Description 


Sp xref keywo 
rdjist 


SEQ ID 


NM_001428 


ENOI 


-2.53 


1.18 


0.46 


ENOI 


enolase 1, (alpha) 


DNA-binding, 

Transcription 

regulation, 

Repressor, 

Nuclear protein, 

Lyase, 

Glycolysis, 

Magnesium, 

Multigene 

family, 

Hypothetical 

d rote in 


71 


NM_001216 


CA9 


-4.72 


1.49 


0.6 


CA9 


carbonic 
anhydrase IX 


Lyase, Zinc, 
Transmembrane 
, Glycoprotein, 
Antigen, Signal, 
Nuclear orotein 
Polymorphism 


65 


NM_001124 


ADM 


-5.68 


2.99 


0.56 


ADM 


Adrenomedullin 


Hormone, 
Amidation, 
Cleavage on 
pair of basic 
residues, Signal 


61 


NM_000584 


IL8 


-2.45 


2.04 


0.54 


IL8 


interleukin 8 


Cytokine, 

wl 1 lULaAlO, 

Inflammatory 

response, 

Signal, 

Alternative 

splicing, 3D- 

structure 


49 


D25328 


PFKP 


-4.19 


3.29 


0.56 


PFKP 


Phosphofructo- 
kinase, platelet 


Kinase, 
Transferase, 

fnlVPOlx/^i*? 
vj? i y i y o to , 

Repeat, 

Allosteric 

enzvme 

Phosphorylation, 
Magnesium, 
Multigene family 


25 


NM_006096 


NDRG1 


-5.45 


5.97 


0.77 


NDRG1 


N-myc downstream 
regulated gene 1 


Hypothetical 
protein, Nuclear 
protein, Repeat 


199 


NM_004994 


MMP9 


-5.53 


1.07 


0.49 


MMP9 


matrix 

metalloproteinase 
9 (gelatinase B, 
92kDa gelatinase , 
92kDa type IV 
collagenase) 


Hydrolase, 

Metalloprotease, 

Glycoprotein, 

Zinc, Zymogen, 

Calcium, 

Collagen 

degradation, 

Extracellular 

matrix, Repeat, 

Signal, 

Polymorphism, 
3D-structure ' 


165 
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Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Sequence 
name 


Description 


Sp xref key wo 
rdjist 


SEQ ID 


NM_00331 1 


TSSC3 


-4.57 


5.58 


0.68 


TSSC3 


tumor suppressing subtransferable 
candidate 3 


117" 


NM_006086 


TUBB4 


-5.19 


2.85 


0.59 


TUBB4 


tubulin, beta, 4 


G-protein 

coupled 

receptor, 

i ransmemurane 

, Glycoprotein, 

Phosphorylation, 

Lipoprotein, 

Palmitate, 

Polymorphism, 

ny pun icui/di 

protein, GTP- 
binding, 

Microtubules, 
Multigene family 


197" 


NM_006115 


PRAME 


-4.48 


2.77 


0.61 


PRAME 


preferentially 
expressed antigen 
in melanoma 


Antigen 


201 


NM_004345 


CAMP 


-2.02 


1.37 


0.49 


C/KMP 


Oct LI IcrllUlU II 1 

antimicrobial 
peptide 


r\\ 1 LIUIULIO, OILJ Ileal 




NM_018455 


BM039 


-2.34 


0.76 


0.47 


BM039 


uncharacterized 
bone marrow 
protein BM039 




265 


Contig49169_RC 




-1.17 


1.5 


0.46 


SUV39H2 


suppressor ot 
variegation 3-9 
(Drosophila) 
homolog 2; 
hypothetical protein 
FLJ23414 


rtypotneucai 
protein, Nuclear 
protein 


32^ 


Contig45032_RC 




-1.37 


0.77 


0.45 


FLJ14813 


hypothetical protein 
FLJ14813 


Hypothetical 
protein, ATP- 
binding, Kinase, 

Ocf II lc/li HcvJi In le 

-protein kinase, 
Transferase 


31^ 


NMJ3009 1 7 


P4HA1 


-1.54 


4.31 


0.62 


P4-HA1 


procollagen- 
proline, 2- 
oxoaiutarate 4- 
dioxygenase 
(proline 4- 
hydroxylase), alpha 
polypeptide I 


Dioxygenase, 
Collagen, 
Oxidored uctase, 
Iron, Vitamin C, 
Alternative 
splicing, 
Glycoprotein, 
Endoplasmic 
reticulum, Signal 


57 


NM_002046 


GAPD 


-2.51 


3.42 


0.6 


GAPD 


glyceraldehyde-3- 

phosphate 

dehydrogenase 


Glycolysis, NAD, 
Oxidored uctase, 
Hypothetical 
protein, 

Multigene family 


87 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 

poor 

xdev 


Correl- 
ation 


Sequence 
name 


Description 


Sp xref keywo 
rdjist 


SEQ ID 


NMJ300365 


TPI1 


-1.81 


2.94 


0.56 


TPI1 


triosephosphate 
isomerase 1 


Fatty acid 

biosynthesis, 

Gluconeogenesi 

s, Glycolysis, 

Isomerase, 

Pentose shunt, 

Disease 

mutation, 

Polymorphism, 

3D-structure 


45 


NM_014364 


GAPDS 


-1.08 


2.88 


0.58 


GAPDS 


glyceraldehyd e-3- 
phosphate 
dehydrogenase, 
testis-specific 


Glycolysis, 

Oxidoreductase, 

NAD 


243 


NM_005566 


LDHA 


-2.01 


4.01 


0.59 


LDHA 


lactate 

dehydrogenase A 


Oxidoreductase, 
NAD, Glycolysis, 
Multigene 
family, Disease 
mutation, 
Polymorphism 


185 


NM_000291 


PGK1 


-2.28 


1.68 


0.51 


PGK1 


phosphoglyce rate 
kinase 1 


Kinase, 

Transferase, 

Multigene 

family, 

Glycolysis, 

Acetvlation 

Disease 

mutation, 

Polymorphism, 

Hereditary 

hemolytic 

anemia 


41 


NM_016185 


LOC51 1 
55 


-2.33 


2.82 


0.59 


HN1 


hematological and 
neurological 
expressed 1 




251 


NMJ)01168 


BIRC5 


-4.33 


2.78 


0.55 


BIRC5 


baculoviral IAR 
repeat-containing 5 
(survivin) 


Apoptosis, Thiol 

protease 

inhibitor, 

Alternative 

splicing, 3D- 

structure, 

Hypothetical 

protein, 

Protease, 

Receptor 


63 


NMJ302266 


KPNA2 


-3.75 


1.34 


0.47 


KPNA2 


karyopherin alpha 
2 (RAG cohort 1, 
importin alpha 1) 


Transport, 

Protein 

transport, 

Repeat, Nuclear 

protein, 

Polymorphism 


95 


Contig31288_RC 




-2.1 


1.27 


0.5 




ESTs, Weakly similar to hypothetical 
protein FLJ20-489 [Homo sapiens] 
[H.sapiens] 


289 
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Contig No. 


Gene 


Avg 

good 

xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Sequence 
name 


Description 


Sp xref keywo 
rdjist 


SEQ ID 


NM_000269 


NME1 


-2.15 


3.43 


0.55 


NME1 


non-metastatic 
cells 1 , protein 
(NM23A) 
expressed in 


Transferase, 
Kinase, ATP- 
binding, Nuclear 
protein, Anti- 
oncogene, 
Disease 
mutation 


39 ! 


NMJW3158 


STK6 


-1.23 


1.73 


0.45 


STK6 


serine/threonine 
kinase 6 


ATP-binding, 
Kinase, 

Serine/threonine 
-protein kinase, 
Transferase 


113 


NM_007274 


HBACH 


-1.83 


2.73 


0.51 


BACH 


brain acyl-CoA 
hydrolase 


Hydrolase, 
Serine esterase, 
Repeat 


223 


Contig55188_RC 




-2.36 


3.28 


0.47 


FU22341 


hypothetical protein 
FLJ22341 


Hypothetical 
protein 


35-1 


NMJD02061 


GCLM 


-1.06 


1.76 


0.48 


GCLM 


glutamate-cysteine 
ligase, modifier 
subunit 


Ligase, 

Glutathione 

biosynthesis 


89 


NM_004207 


SLC16A 
3 


-3.11 


5.07 


0.67 


SLC16A3 


solute carrier family 
16 (monocarboxylic 
acid transporters), 
member 3 


Transport, 
Symport, 
Transmembrane 
, Multigene 
family 


139 


NMJD00582 


SPP1 


-5.G9 


5.47 


0.53 


SPP1 


secreted 

phosphoprotein 1 
(osteopontin, bone 
sialoprotein I, early 
T-lymphocyte 
activation 1) 


Hypothetical 

protein, 

Glycoprotein, 

Sialic acid, 

Biomineralizatio 

n, Cell adhesion, 

Phosphorylation, 

Signal, 

Alternative 

splicing 


47 


NM_001109 


ADAM8 


-2.5 


3.74 


0.45 


ADAM 8 


a disintegrin and 
metalloproteinase 
domain 8 


Hydrolase, 
Metalloprotease, 
Zinc, Signal, 
Glycoprotein, 
Transmembrane 
, Antigen 


59 j 


D50402 


SLC11A 
1 


-1.05 


3.46 


0.53 


SLC11A1 


solute carrier family 
11 (proton-coupled 
divalent metal ion 
transporters), 
member 1 


Transport, Iron 

transport, 

Transmembrane 

, Glycoprotein, 

Macrophage, 

Polymorphism 


2T 


AL080235 


DKFZP5 

86E162 

1 


-1.23 


1.96 


0.51 


RIS1 


Ras-induced 
senescence 1 


Hypothetical 
protein 


XT 


Contig40552_RC 




-1.26 


3.96 


0.54 


FLJ25348 


hypothetical protein 
FLJ25348 


Hypothetical 
protein 


303 


Contig52490_RC 




-0.64 


3.33 


0.61 


LOC11623 
8 


hypothetical protein 
BC014072 




341 


NMJD06461 


DEEPE 
ST 


-2.1 


1.85 


0.46 


SPAG5 


sperm associated 
antigen 5 


Hypothetical 
protein 


207 
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Accession/ 
Contig No. 


Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Sequence 
name 


Description 


Sp xref keywo 
rdjist 


SEQ ID 


Contig56503_RC 




-4.3 


3.39 


0.55 


MGC9753 


hypothetical gene 
MGC9753 


Hypothetical 
protein 


355 


Contig63525 




-1.91 


3.34 


0.5 


FLJ 13352 


hypothetical protein 
FLJ 13352 


Hypothetical 
protein 


363 


NM_001909 


CTSD 


-0.83 


4.6 


0.51 


CTSD 


oathepsin D 
(lysosomal aspartyl 
protease) 


Hydrolase, 
Aspartyl 
protease, 
Glycoprotein, 

i— y ououi 1 iv?, 

Signal, 

Zymogen, 

Polymorphism, 

Alzheimer's 

disease, 3D- 

structure 


83 


NM_005063 


SCD 


-2.57 


5.15 


0.48 


SCD 


stearoyl-CoA 
desaturase (delta- 

«-/ ucoaiu I aoc j 


Hypothetical 
protein, 

i — i ivjvj|jicaoi 1 1 \\j 

reticulum, Fatty 
acid 

biosynthesis, 
Iron, 

Oxidoreductase, 
Transmembrane 


167 


NM_005165 


ALDOC 


-2.43 


5.02 


0.48 


ALDOC 


aldolase C, 

fructose- 

bisphosphate 


Lyase, Schiff 
base, 

Glycolysis, 
Multigene family 


173 


NM_000363 


TNNI3 


-0.54 


3.58 


0.48 


TNNI3 


troponin I, cardiac 


Hypothetical 
protein, Muscle 
protein, Actin- 

hinHinn 

Acetylation, 

Disease 

mutation, 

Cardiomyopathy 

, Receptor, 

Signal 


43 


AF035284 




-1.63 


3.28 


0.47 


FADS1 


EST 


Heme, 

Hypothetical 

protein 


7 


Contig30875_RC 




-0.88 


3 


0.6 




ESTs 




285 


NM_018487 


HCA112 


-0.7 


3.54 


0.58 


HCA112 


hepatocellular 
carcinoma- 
associated antigen 
112 


Hypothetical 
protein 


269 


NM_001323 


CST6 


-1.63 


3.84 


0.57 


CST6 


cystatin E/M 


Thiol protease 
inhibitor, Signal, 
Glycoprotein 


69 ; 
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Gene 


Avg 
good 
xdev 


Avg 
poor 
xdev 


Correl- 
ation 


Sequence 
name 


Description 


Sp xref keywo 
rdjist 


SEQ ID 


NM_006516 


SLC2A1 


-1.66 


2.22 


0.46 


SLC2A1 


solute carrier family 
2 (facilitated 
glucose 
transporter), 
member 1 


Transmembrane 

Sugar 
transport, 
Transport, 
Glycoprotein, 
Multigene 
family, Disease 
mutation 


209 


NM_007267 


LAK-4P 


-1.04 


3.28 


0.61 


EVIN1 


expressed in 
activated T/LAK 
lymphocytes 


Hypothetical 
protein 


221 


MM_004710 


SYNGR 
2 


-0.84 


4.81 


0.56 


SYNGR2 


synaptogyrin 2 


Transmembrane 


157 


Contig63649__RC 




-1.34 


6.3 


0.75 




ESTs, Weakly simile 
chromosomal protei 
sapiens] [H. sapiens 


ar to 2004399A 
n [Homo 


361 


NM_003376 


VEGF 


-2.12 


2.42 


0.46 


VEGF 


vascular 

endothelial g rowth 
factor 


Hypothetical 
protein, Mitogen, 
Angiogenesis, 
Growth factor, 
Glycoprotein, 
Sianal Heoarin- 
binding, 
Alternative 
splicing, 
Multigene 
family, 3D- 
structure 


119 


MM 0007Q9 


EPO 




4.01 


0.69 


EPO 


prvthronoipti n 


Frvthrocvte 

maturation, 

Glycoprotein, 

Hormone, 

Signal, 

Pharmaceutical, 
3D-structure 


55 


MM 006014 


DXS987 
9E 


-1.85 


3.44 


0.54 


DXS9879E 


DNA segment on chromosome X 
(unique) 9879 expressed sequence 


195 


NM_007183 


PKP3 


-0.91 


4.14 


0.48 


PKP3 


plakophilin 3 


Cell adhesion, 
Cytoskeleton, 
Structural 
protein, Nuclear 
protein, Repeat 


219 


D 13642 


SF3B3 


-0.65 


2.28 


0.48 


SF3B3 


splicing factor 3b, 
subunit3, 130kDa 


Hypothetical 

protein, 

Spliceosome, 

mRNA 

processing, 

mRNA splicing, 

Nuclear protein 


23 


NIM_003756 


EIF3S3 


-1.85 


2.19 


0.46 


EIF3S3 


eukaryotic 
translation in itiation 
factor 3, subunit 3 
gamma, 40kDa 


Initiation factor, 

Protein 

biosynthesis 


129 
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name 
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SEQ ID 


Contig47096_RC 




-0.41 


4.52 


0.54 


PFKFE34 


6-phosphofructo~2- 
kinase/fructose- 
2,6-biphosphatase 
4 


Kinase, 

Multifunctional 
enzyme, 
Transferase, 
Hydrolase, ATP- 
binding, 

Phosphorylation, 
Multigene family 


323 


NM__004209 


SYNGR 
3 


-0.31 


3.67 


0.53 


SYNG R3 


synaptogyrin 3 


Transmembrane 


I 141 


Contig3464_RC 




0.99 


-5.81 


-0.52 




ESTs 




277 


Contig31646_RC 




1.1 


-7.76 


-0.5 


COL1^A1 


collagen, type XIV, 
alpha 1 (undulin) 


Extracellular 
matrix, 

Glycoprotein, 

Hypothetical 

protein, 

Collagen, Signal 


291 


Contig49388__RC 




1.73 


-1.75 


-0.51 


FLJ13322 


hypothetical protein 
FLJ 13322 


Hypothetical 
protein 


331 


Contig41887_RC 




0.37 


-5.74 


-0.47 


L0C12422 
0 


similar to common 
salivary protein 1 


Hypothetical 
protein 


309 



[00104] 



5 .4 DIAGNOSTIC AND PROGNOSTIC METHODS 
5 .4. 1 SAMPLE COLLECTION 
[00105] In the present invention, markers, such as target polynucleotide molecules or 
proteins, are extracted from a sample taken from an individual afflicted with a condition such 
as breast cancer. The sample may be collected in amy clinically acceptable manner, but must 
be collected such that marker-derived polynucleotides (i.e., RNA) are preserved (if gene 
expression is to be measured) or proteins are preserved (if encoded proteins are to be 
measured). For example, mRNA or nucleic acids derived therefrom (i.e., cDNA or amplified 
DNA) are preferably labeled distinguishably from standard or control polynucleotide 
molecules, and both are simultaneously or independently hybridized to a microarray 
comprising some or all of the markers or marker sets or subsets described above. 
Alternatively, mRNA or nucleic acids derived thereirom may be labeled with the same label 
as the standard or control polynucleotide molecules^ wherein the intensity of hybridization of 
each at a particular probe is compared. A sample nxay comprise any clinically relevant tissue 
sample, such as a tumor biopsy or fine needle aspirate, or a sample of bodily fluid, such as 
blood, plasma, serum, lymph, ascitic fluid, cystic fluid, urine or nipple exudate. The sample 
may be taken from a human, or, in a veterinary context, from non-human animals such as 
ruminants, horses, swine or sheep, or from domestic companion animals such as felines and 
canines. 
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[00106] Methods for preparing total and poly(A)+ RNA are well known and are described 
generally in Sambrook et aL, Molecular Cloning - A Laboratory^ Manual (2nd Ed.), 
Vols. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, New ~York (1989)) and 
Ausubel et aL, Current Protocols in Molecular Biology, vol. 2, Current Protocols 
Publishing, New York (1994)). 

[00107] RNA may be isolated from eukaryotic cells by procedures that involve lysis of the 
cells and denaturation of the proteins contained therein. Cells of interest include wild-type 
cells {i.e., non-cancerous), drug-exposed wild-type cells, tumor- or tumor-derived cells, 
modified cells, normal or tumor cell line cells, and drug-exposed modified cells. Preferably, 
the cells are breast cancer tumor cells. 

[00108] Additional steps may be employed to remove DNA. Cell lysis may be 
accomplished with a nonionic detergent, followed by microcentrifugartion to remove the 
nuclei and hence the bulk of the cellular DNA. In one embodiment, RNA is extracted from 
cells of the various types of interest using guanidinium thiocyanate lysis followed by CsCl 
centrifugation to separate the RNA from DNA (Chirgwin et aL, Biochemistry 18:5294-5299 
(1979)). Poly(A)+ RNA is selected by selection with oligo-dT cellulose {see Sambrook et 
aL, Molecular Cloning - A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, New York (1989). Alternatively, separation of 
RNA from DNA can be accomplished by organic extraction, for example, with hot phenol or 
phenol/chloroform/isoamyl alcohol. 

[00109] If desired, RNase inhibitors may be added to the lysis buffer. Likewise, for certain 
cell types, it may be desirable to add a protein denaturation/digestion step to the protocol. 
[001 10] For many applications, it is desirable to preferentially enrich mRNA with respect to 
other cellular RNAs, such as transfer RNA (tRNA) and ribosomal RNJA (rRNA). Most 
mRNAs contain a poly(A) tail at their 3 5 end. This allows them to be enriched by affinity 
chromatography, for example, using oligo(dT) or poly(U) coupled to a solid support, such as 
cellulose or Sephadex™ (see Ausubel et al., Current Protocols in Molecular Biology, 
vol. 2, Current Protocols Publishing, New York (1994). Once bounds poly(A)+ mRNA is 
eluted from the affinity column using 2 mM EDTA/0.1% SDS. 

[001 1 1] The sample of RNA can comprise a plurality of different mRNA molecules, each 
different mRNA molecule having a different nucleotide sequence. In a specific embodiment, 
the mRNA molecules in the RNA sample comprise at least 5, 10, 15, 20, 25, 30, 40 or 50 
different nucleotide sequences. More preferably, the mRNA molecules of the RNA sample 
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comprise mRNA molecules corresponding to each of the marker genes. In another specific 
embodiment, the RNA sample is a mammalian RNA sample. 

[001 12] In a specific embodiment, total RNA or mRNA from cells are used in the methods of 
the invention. The source of the RNA can be cells of a plant or animal, human, mammal, 
primate, non-human animal, dog, cat, mouse, rat, bird, yea.st, eukaryote, prokaryote, etc. In 
specific embodiments, the method of the invention is used with a sample containing total 
mRNA or total RNA from 1 x 10 6 cells or less. In another embodiment, proteins can be 
isolated from the foregoing sources, by methods known in the art, for use in expression 
analysis at the protein level. 

[001 13] Probes to the homologs of the marker sequences disclosed herein can be employed 
preferably when non-human nucleic acid is being assayed. 

[001 14] The methods of the invention may employ any molecule suitable as a marker. For 
example, sets of proteins informative for a particular condition, including a disease, may be 
determined. As for gene-based markers, levels of variations of different proteins in samples 
may be determined for phenotypic or genotypic subsets of the condition, and proteins 
showing significant variation in either level (abundance) or activity, or both, may be 
identified in order to create a set of proteins informative for one or more of these subsets. 
Such proteins may be identified, for example, by use of gel electrophoresis, such as one- 
dimensional polyacrylamide gel electrophoresis, two-dimensional polyacrylamide gel 
electrophoresis, nondenaturing polyacrylamide gel electrophoresis; isoelectric focusing gels, 
etc., by use of antibody arrays, etc. Of course, the particular template(s) used to classify the 
individual depends upon the type(s) of cellular constituents used as markers. For example, 
where nucleic acids (e.g., genes or nucleic acids derived from expressed genes) are used as 
markers, the template comprises nucleic acids (or the level of expression or abundance 
thereof); where proteins are used as markers, the template comprises proteins, for example, 
the level or abundance of those proteins in a set of individuals; etc. 

5 .4.2 USE OF PROGNOSTIC GENESETS FOR BREAST CANCER 
[001 15] According to the present invention, once genesets informative for a plurality of 
subsets of a condition are identified, an individual is classified into one of these subsets and a 
prognosis is made based on the expression of the genes, or their encoded proteins, in the 
geneset for that subset in a breast cancer tumor sample taken from the individual. 
[001 16] For example, a particular hypothetical condition has three relevant phenotypic 
characteristics, A, B and C. In this example, based on these characteristics, genesets 
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informative for prognosis of four patient subsets A + B + ; ^A + ETC + ; A + ETC~; and A" are 
identified by the method described above. Thus, an individual having the condition would 
first be classified according to phenotypes A-C into one of the four patient subsets. In one 
embodiment, therefore, the invention provides for the classification of an individual having a 
condition into one of a plurality of patient subsets, wherein a set of genes informative for 
prognosis for the subset has been identified. A sample is then taken from the individual, and 
the expression of the prognostically-informative genes in the sample is analyzed and 
compared to a control. In various embodiments, the control is the average expression of 
informative genes in a pool of samples taken from good, prognosis individuals classifiable 
into that patient subset; the average expression of informative genes in a pool of samples 
taken from poor prognosis individuals classifiable into that patient subset; a set of 
mathematical values that represent gene expression levels of good prognosis individuals 
classifiable into that patient subset; etc. 

[001 17] In another embodiment, a sample is taken froirt the individual, and the levels of 
expression of the prognostically-informative genes in tlae sample is analyzed. In one 
embodiment, the expression level of each gene can be compared to the expression level of the 
corresponding gene in a control of reference sample to determine a differential expression 
level. The expression profile comprising expression levels or differential expression levels of 
the plurality of genes is then compared to a template profile. In various embodiments, the 
template profile is a good prognosis template comprising the average expression of 
informative genes in samples taken from good prognosis individuals classifiable into that 
patient subset; or a poor prognosis template comprising the average expression of informative 
genes in samples taken from poor prognosis individuals classifiable into that patient subset; 
or a good prognosis profile comprising a set of mathematical values that represent gene 
expression levels of good prognosis individuals classifiable into that patient subset; etc. 
[001 18] In a specific embodiment, the condition is breast cancer, and the phenotypic, 
genotypic and/or clinical classes are: ER~, BRCA1 individuals; ER~~, sporadic individuals; 
ER+, ER/AGE high individuals; ER+, ER/AGE low, KN+ individuals; and ER+, ER/AGE 
low, LN™ individuals. In this embodiment, an individual may be classified as ER+ or ER . 
If the individual is ER~ the individual is additionally classified as having a BRCA1 -type or 
sporadic tumor. ER~ individuals are thus classified as ER - , BRCA1 or ER~, sporadic. 
Alternatively, if the individual is classified as ER+, the individual is additionally classified as 
having a high or low ratio of the log (ratio) of the level of expression of the gene encoding 
the estrogen receptor to the individual's age. Individuals having a low ratio are additionally 
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classified as LN+ or LN-. ER+ individuals are thus classified as ER+, ER/AGE high; ER+, 
ER/AGE low, LN+, or ER+, ER/AGE low, LN~ Of course, the individual's ER status, 
tumor type, age and LN status may be identified in any order, as long as the individual is 
classified into one of these five subsets. 

[001 19] Thus, in one embodiment, the invention provides a method of classify ing an 
individual with a condition as having a good prognosis or a poor prognosis, comprising: (a) 
classifying said individual into one of a plurality of patient classes, said patient classes being 
differentiated by one or more phenotypic, genotypic or clinical characteristics of said 
condition; (b) determining the level of expression of a plurality of genes or their encoded 
proteins in a cell sample taken from the individual relative to a control, said plurality of genes 
or their encoded proteins comprising genes or their encoded proteins in a cell sample taken 
from the individual relative to a control, said plurality of genes or their encoded proteins 
comprising genes or their encoded proteins informative for prognosis of the patient class into 
which said individual is classified; and (c) classifying said individual as having a good 
prognosis or a poor prognosis on the basis of said level of expression. In a specific 
embodiment, said condition is breast cancer, said good prognosis is the non-oocurrence of 
metastases within five years of initial diagnosis, and said poor prognosis is the occurrence of 
metastases within five years of initial diagnosis. In an more specific embodiment, said 
classifying said individual with a condition as having a good prognosis or a poor prognosis is 
carried out by comparing the level expression of each of said plurality of genes or their 
encoded proteins to said average level of expression of each corresponding gene or its 
encoded protein in said control, and classifying said individual as having a good prognosis 
poor prognosis if said level of expression correlates with said average level of expression of 
each of said genes or their encoded proteins in a good prognosis control or a poor prognosis 
control, respectively, more strongly than would be expected by chance. In a more specific 
embodiment of the method, said plurality of patient subsets comprises ER~ BJRCA1 
individuals; ER" sporadic individuals; ER+, ER/AGE high individuals; ER+, ER/AGE low, 
LN+ individuals; and ER+, ER/AGE low, LN~ individuals. In another embodiment, said 
control is the average level of expression of each of said plurality of genes informative for 
prognosis in a pool of tumor samples from individuals classified into said subset who have a 
good prognosis or good outcome, or who have a poor prognosis or good outcome. In another 
specific embodiment, said control is a set of mathematical values representing the average 
level of expression of genes informative for prognosis in tumor samples of individuals 
classifiable into said subset who have a good prognosis, or who have a poor prognosis. 
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[00120] It is evident that the different patient subsets described herein reflect different 
molecular mechanisms of the initiation of tumor formation and metastasis. Thus, the 
genesets listed in tables 1-5 are also useful for diagnosing a person as having a particular type 
of breast cancer in the first instance. Thus, the invention also provides a method of 
diagnosing an individual as having a particular subtype of breast cancer, comprising 
determining the level of expression in a sample from said individual of a plurality of the 
genes for which markers are listed in Tables 1-5; and comparing said expression to a control, 
where said control is representative of the expression of said plurality of genes in a breast 
cancer sample of said subtype of cancer, and on the basis of said comparison, diagnosing the 
individual as having said subtype of breast cancer. In a specific embodiment, said subtype of 
cancer is selected from the group consisting of ER - , BRCA1 type; ER - sporadic type; ER+, 
ER/AGE high type; ER+, ER/AGE low, LN+ type; and ER/AGE low, LN - type. In another 
specific embodiment, said control is the average level of expression of a plurality of the genes 
for which markers are listed in Table 1, Table 2, Table 3, Table 4 or Table 5. In another 
specific example, said comparing comprises determining the similarity of the expression of 
the genes for which markers are listed in each of Tables 1-5 in. said sample taken from said 
individual to a control level of expression of the same genes for each of Tables 1-5, and 
determining whether the level of expression of said genes in said sample is most similar to 
said control expression of the genes for which markers are listed in Table 1, Table 2, Table 3, 
Table 4 or Table 5. 

[00121] In another embodiment, the invention provides a method of classifying an individual 
as having a good prognosis or a poor prognosis, comprising: Ca) classifying said individual as 
ER - BRCA1; ER - sporadic; ER+, ER/AGE high; ER+, ER/AGE low, LN+; or ER+, 
ER/AGE low, LIST; (b) determining the level of expression of a first plurality of genes in a 
cell sample taken from the individual relative to a control, said first plurality of genes 
comprising two of the genes corresponding to the markers Table 1 if said individual is 
classified as ER - BRCA1; Table 2 if said individual is classified as ER~ sporadic; Table 3 if 
said individual is classified as ER+, ER/AGE high; Table 4 if said individual is classified as 
ER+, ER/AGE low, LN+; or Table 5 if said individual is classified as ER+, ER/AGE low, 
LN - wherein said individual is "ER/AGE high" if the ratio of ER expression to age exceeds 
a predetermined value, and "ER/AGE low" if the ratio of ER expression to age does not 
exceed said predetermined value. In a specific embodiment of this method, said 
predetermined value of ER calculated as ER = 0.1 (AGE - 42.5), wherein AGE is the age of 
said individual. In another specific embodiment, said individual is ER - , BRCA1, and said 
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plurality of genes comprises (i.e., contains at least) 1, 2, 3, 4, 5, 10 or all of the genes for 
which markers are listed in Table 1. In another specific embodiment, said individual is ER , 
sporadic, and said plurality of genes comprises (i.e., contains at least) 1, 2, 3, 4, 5, 10 or all of 
the genes for which markers are listed in Table 2. In another specific embodiment, said 
individual is ER+, ER/AGE high, and said plurality of genes comprises (i.e., contains at least) 
1, 2, 3, 4, 5, 10 or all of the genes for which markers are listed in Table 3. In another specific 
embodiment, said individual is ER+, ER/AGE low, LN+, and said plurality of genes 
comprises (i.e., contains at least) 1, 2, 3, 4, 5, 10 or all of the genes for which markers are 
listed in Table 4. In another specific embodiment, said individual is ER+, ER/AGE low, 
LN^, and said plurality of genes comprises (i.e., contains at least) 1, 2, 3, 4-, 5, 10 or all of the 
genes for which markers are listed in Table 5. In another specific embodiment, the method 
additionally comprises determining in said cell sample the level of expression, relative to a 
control, of a second plurality of genes for which markers are not found inXables 1-5, wherein 
said second plurality of genes is informative for prognosis. 

[00122] In one embodiment, the invention provides a method of classifying an individual 
witti a condition as having a good prognosis or a poor prognosis, comprising: (a) classifying 
said individual into one of a plurality of patient classes, said patient classes being 
differentiated by one or more phenotypic, genotypic or clinical characteris-tics of said 
condition; (b) determining the levels of expression of a plurality of genes or their encoded 
proteins in a cell sample taken from the individual, optionally relative to a control, said 
plurality of genes or their encoded proteins comprising genes or their encoded proteins 
informative for prognosis of the patient class into which said individual is classified; and (c) 
classifying said individual as having a good prognosis or a poor prognosis on the basis of said 
levels of expression. In a specific embodiment, said condition is breast cancer, said good 
prognosis is the non-occurrence of metastases within five years of initial diagnosis, and said 
poor prognosis is the occurrence of metastases within five years of initial diagnosis. In an 
more specific embodiment, said classifying said individual with a condition as having a good 
prognosis or a poor prognosis is carried out by comparing the patient's expression profile of 
said plurality of genes or their encoded proteins to a good and/or poor pro gnosis template 
profile of expression levels of said plurality of genes or their encoded proteins, and 
classifying said individual as having a good prognosis or poor prognosis if said patient 
expression profile has a high similarity to a good prognosis template or a poor prognosis 
template, respectively. In a more specific embodiment of the method, said plurality of patient 
subsets comprises ER - , BRCA1 individuals; ER - , sporadic individuals; ER+, ER/AGE high 
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individuals; ER+, ER/AGE low, LN+ individuals; and ER+, ER/AGE low, LN~ individuals. 
In another embodiment, said good prognosis template comprises the average level of 
expression of each of said plurality of genes informative for prognosis in tumor samples from 
individuals classified into said subset who have a good prognosis or good outcome, while 
said poor prognosis template comprises the average level of expression of each of said 
plurality of genes informative for prognosis in tumor samples from individuals classified into 
said subset who have a poor prognosis or poor outcome. In another specific embodiment, 
said good or poor prognosis template is a set of mathematical values representing the average 
level of expression of genes informative for prognosis in tumor samples of individuals 
classifiable into said subset who have a good prognosis, or who have a poor prognosis, 
respectively. 

[00123] It is evident that the different patient subsets described herein reflect different 
molecular mechanisms of the initiation of tumor fonraation and metastasis. Thus, the 
genesets listed in tables 1-5 are also useful for diagnosing a person as having a particular type 
of breast cancer in the first instance. Thus, the invention also provides a method of 
diagnosing an individual as having a particular subtype of breast cancer, comprising 
determining an expression profile of a plurality of the genes for which markers are listed in 
Tables 1-5 in a sample from said individual; and conxparing said expression profile to a 
template profile, where said template is representative of the expression of said plurality of 
genes in a breast cancer sample of said subtype of cancer, and on the basis of said 
comparison, diagnosing the individual as having said subtype of breast cancer. In a specific 
embodiment, said subtype of cancer is selected from the group consisting of ER~, BRCA1 
type; ER~ sporadic type; ER+, ER/AGE high type; ER+, ER/AGE low, LN+ type; and 
ER/AGE low, LN~ type. In another specific embodixnent, said template comprises the 
average levels of expression of a plurality of the genes for which markers are listed in Table 
1, Table 2, Table 3, Table 4 or Table 5. In another specific example, said comparing 
comprises determining the similarity of the expression profile of the genes for which markers 
are listed in each of Tables 1-5 in said sample taken firom said individual to a template profile 
comprising levels of expression of the same genes fox each of Tables 1-5, and determining 
whether the pattern of expression of said genes in said sample is most similar to the pattern of 
expression of the genes for which markers are listed in Table 1, Table 2, Table 3, Table 4 or 
Table 5. 

[00124] In another embodiment, the invention provides a method of classifying an individual 
as having a good prognosis or a poor prognosis, comprising: (a) classifying said individual as 
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ER~ BRCA1; ER" sporadic; ER+, ER/AGE high; ER+, ER/AGE low, LN+; or ER+, 
ER/AGE low, LN~ ; (b) determining an expression profile of a first plurality of genes in a cell 
sample taken from the individual relative to a control, said first plurality of genes comprising 
at least two of the genes corresponding to the markers Table 1 if said individual is classified 
as ER", BRCA1; Table 2 if said individual is classified as ER~ sporadic; Table 3 if said 
individual is classified as ER+, ER/AGE high; Table 4 if said individual is classified as ER+, 
ER/AGE low, LN+; or Table 5 if said individual is classified as ER+, ER/AGE low, LN , 
wherein said individual is "ER/AGE high" if the ER level of the individual exceeds a 
predetermined value, and "ER/AGE low" if the ER level of the individual does not exceed 
said predetermined value. In a specific embodiment of this method, said predetermined value 
of ER is calculated as ER = 0.1 (AGE - 42.5), wherein AGrE is the age of said individual. In 
another specific embodiment, said individual is ER"", BRCA1, and said plurality of genes 
comprises at least 1, 2, 3, 4, 5, 10 or all of the genes for which markers are listed in Table 1. 
In another specific embodiment, said individual is ER~~, sporadic, and said plurality of genes 
comprises at least 1, 2, 3, 4, 5, 10 or all of the genes for which markers are listed in Table 2. 
In another specific embodiment, said individual is ER+, ER/AGE high, and said plurality of 
genes comprises at least 1, 2, 3, 4, 5, 10 or all of the genes for which markers are listed in 
Table 3. In another specific embodiment, said individual is ER+, ER/AGE low, LN+, and 
said plurality of genes comprises at least 1, 2, 3, 4, 5, 10 or all of the genes for which markers 
are listed in Table 4. In another specific embodiment, said individual is ER+, ER/AGE low, 
LN~ and said plurality of genes comprises at least 1, 2, 3 , 4, 5, 10 or all of the genes for 
which markers are listed in Table 5. In another specific embodiment, the method additionally 
comprises determining in said cell sample the level of expression, relative to a control, of a 
second plurality of genes for which markers are not found in Tables 1-5, wherein said second 
plurality of genes is informative for prognosis. 

[00125] Where information is available regarding the LKT status of a breast cancer patient, the 
patient may be identified as having a "very good prognosis," an "intermediate prognosis," or 
a poor prognosis, which enables the refinement of treatment. In one embodiment, the 
invention provides a method of assigning a therapeutic regimen to a breast cancer patient, 
comprising: (a) classifying said patient as having a "poor prognosis," "intermediate 
prognosis," or "very good prognosis" on the basis of the levels of expression of at least five 
genes for which markers are listed in Table 1, Table 2, Table 3, Table 4 or Table 5; and (b) 
assigning said patient a therapeutic regimen, said therapeutic regimen (i) comprising no 
adjuvant chemotherapy if the patient is lymph node negative and is classified as having a 
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good prognosis or an intermediate prognosis, or (ii) comprising chemotherapy if said patient 
has any other combination of lymph node status and expression profile. 
[00126] In another embodiment, a breast cancer patient is assigned a prognosis by a method 
comprising (a) determining the breast cancer patient's age, ER status, LN status and tumor 
type; (b) classifying said patient as ER~ sporadic; ER~ BRCA1; ER+, ER/AGE high; ER+, 
ER/AGE low, LN+; or ER+, ER/AGE low, LN~; (c) determining an expression profile 
comprising at least five genes in a cell sample takzen from said breast cancer patient wherein 
markers for said at least five genes are listed in Table 1 if said patient is classified as ER , 
sporadic; Table 2 if said patient is classified as ER" BRCA1; Table 3 if said patient is 
classified as ER+, ER/AGE high; Table 4 if said patient is classified as ER+, ER/AGE low, 
LN+; or Table 5 if said patient is classified as ER^+, ER/AGE high, LN~; (d) determining the 
similarity of the expression profile of said at least five genes to a template profile comprising 
levels of expression of said at least five genes to obtain a patient similarity value; (e) 
comparing said patient similarity value to selected first and second threshold values of 
similarity, respectively, wherein said second similarity threshold indicates greater similarity 
to said template expression profile than does said first similarity threshold; and (f) classifying 
said breast cancer patient as having a first prognosis if said patient similarity value exceeds 
said second threshold similarity values, a second prognosis if said patient similarity value 
exceeds said first threshold similarity value but does not exceed said second threshold 
similarity value, and a third prognosis if said patient similarity value does not exceed said 
first threshold similarity value. In a specific embodiment of the method, said first prognosis 
is a "very good prognosis," said second prognosis is an "intermediate prognosis," and said 
third prognosis is a "poor prognosis," wherein said breast cancer patient is assigned a 
therapeutic regimen comprising no adjuvant chemotherapy if the patient is lymph node 
negative and is classified as having a good prognosis or an intermediate prognosis, or 
. comprising chemotherapy if said patient has any other combination of lymph node status and 
expression profile. 

[00127] The invention also provides a method of assigning a therapeutic regimen to a breast 
cancer patient, comprising: (a) determining the lymph node status for said patient; (b) 
determining the expression of at least five genes for which markers are listed in Table 5 in a 
cell sample from said patient, thereby generating an expression profile; (c) classifying said 
patient as having a "poor prognosis," "intermediate prognosis," or "very good prognosis" on 
the basis of said expression profile; and (d) assigning said patient a therapeutic regimen, said 
therapeutic regimen comprising no adjuvant chemotherapy if the patient is lymph node 
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negative and is classified as having a good prognosis or an intermediate prognosis, or 
comprising chemotherapy if said patient has any other combination of lymph node statas and 
classification. In a specific embodiment of this method, said therapeutic regimen assigned to 
lymph node negative patients classified as having an "'intermediate prognosis" additionally 
comprises adjuvant hormonal therapy. In another specific embodiment of this method, said 
classifying step (c) is carried out by a method comprising: (a) rank ordering in descendimg 
order a plurality of breast cancer tumor samples that compose a pool of breast cancer tuxnor 
samples by the degree of similarity between the expression profile of said at least five genes 
in each of said tumor samples and the expression profile of said at least five genes across all 
remaining tumor samples that compose said pool, said degree of similarity being expressed as 
a similarity value; (b) determining an acceptable number of false negatives in said classifying 
step, wherein a false negative is a breast cancer patient for whom the expression levels of said 
at least five genes in said cell sample predicts that said breast cancer patient will have no 
distant metastases within the first five years after initial diagnosis, but who has had a distant 
metastasis within the first five years after initial diagnosis; (c) determining a similarity -value 
above which in said rank ordered list said acceptable number of tumor samples or fewex are 
false negatives; (d) selecting said similarity -value determined in step (c) as a first threshold 
similarity value; (e) selecting a second similarity value, greater than said first similarity 
value, as a second threshold similarity value; and (f) determining the similarity between the 
expression profile of said at least five genes in a breast cancer tumor sample from the breast 
cancer patient and the expression profile of said respective at least five genes in said pool, to 
obtain a patient similarity value, wherein if said patient similarity value equals or exceeds 
said second threshold similarity value, said patient is classified as having a "very good 
prognosis"; if said patient similarity value equals or exceeds said first threshold similarity 
value, but is less than said second threshold similarity value, said patient is classified as 
having an "intermediate prognosis"; and if said patient similarity value is less than said first 
threshold similarity value, said patient is classified as having a "poor prognosis." Another 
specific embodiment of this method comprises determining the estrogen receptor (ER) status 
of said patient, wherein if said patient is ER- positive and lymph node negative, said 
therapeutic regimen assigned to said patient additionally comprises adjuvant hormonal 
therapy. 

[00128] A patient in any patient subset or clinical class, e.g., any one of the classes described 
above, can be classified as having a particular prognosis level, e.g., a good prognosis ox a 
poor prognosis, based on the similarity of ttie patient's cellular constituent profile to arx 
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appropriate template profile for the prognosis level of patients in the clinical class. In one 
embodiment, a cellular constituent profile corresponding to a certain prognosis level, e.g., a 
profile comprising measurements of the plurality of cellular constituents representative of 
levels of the cellular constituents in a plurality of patients having the prognosis level is used 
as a template for the prognosis level. For example, a good prognosis template profile 
comprising measurements of the plurality of cellular constituents representative of levels of 
the cellxilar constituents in a plurality of good outcome patients or a poor prognosis template 
profile comprising measurements of the plurality of cellular constituents representative of 
levels of the cellular constituents in a plurality of poor outcome patients, can be used for 
determining whether a patient have good or poor prognosis. Here, a good outcome patient is 
a patient who has non-reoccurrence of metastases within a period of time after initial 
diagnosis, e.g., a period of 1, 2, 3, 4, 5 or 10 years. In contrast, a poor outcome patient is a 
patient who has reoccurrence of metastases within a period of time after initial diagnosis, e.g., 
a period of 1, 2, 3, 4, 5 or 10 years. In a preferred embodiment, both periods are 10 years. 
Tables 1-5 show exemplary template profiles for the respective patient classes. For example, 
the expression profile of a patient with a combination of ER+, ER/AGE low, LN+ can be 
compared with the good prognosis template of Table 4 to determine if the patient has good 
prognosis or poor prognosis. 

[00129] The degree of similarity of the patient's cellular constituent profile to a template of a 
particular prognosis can be used to indicate whether the patient has the particular prognosis. 
For example, a high degree of similarity indicates that the patient has the particular prognosis, 
whereas a low degree of similarity indicates that the patient does not have the particular 
prognosis. In a preferred embodiment, a patient is classified as having a good prognosis 
profile if the patient's cellular constituent profile has a high similarity to a good prognosis 
template and/or has a low similarity to a poor prognosis template. In another embodiment, a 
patient is classified as having a poor prognosis profile if the patient's cellular constituent 
profile has a low similarity to a good prognosis template and/or has a high similarity to a poor 
prognosis template. In embodiments for predicting the responsiveness of a breast cancer 
patient under the age of 55, the patients in the good and poor* outcome patient populations 
used to generate the templates are preferably also under the age of 55 at the time of diagnosis 
of breast cancer. 

[0013O] The degree of similarity between a patient's cellulax constituent profile and a 
template profile can be determined using any method known in the art. In one embodiment, 
the similarity is represented by a correlation coefficient between the patient's profile and the 
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template. In one embodiment, a correlation coefficient above a correlation threshold 
indicates high similarity, whereas a correlation coefficient below the threshold indicates low 
similarity. In preferred embodiments, the correlation threshold is set as 0.3, 0.4, 0.5 or 0.6. 
In another embodiment, similarity between a patient's profile and a template is represented 
by a distance between the patient's profile and the template. In one embodiment, a distance 
below a given value indicates high similarity, whereas a distance equal to or greater than the 
given value indicates low similarity. 

[0013 1] As an illustration, in one embodiment, a template for a good prognosis is defined as 
z x (e.g., a profile consisting of the xdev's listed in the good prognosis column of one of 
Tables 1-5) and/or a template for poor prognosis is defined as z 2 (e.g., a profile consisting of 
the xdev's listed in the poor prognosis column of one of Tables 1-5). Either one or both of 
the two classifier parameters ( P x and P 2 ) can then be used to measure degrees of similarities 
between a patient's profile and the respective templates: F\ measures the similarity between 
the patient's profile y and the good prognosis template z x , and P 2 measures the similarity 
between y and the poor prognosis template z 2 . In embodiments which employ correlation 
coefficients, the correlation coefficient P i can be calculated as: 



where i = 1 and 2. 

[00132] Thus, in one embodiment, y is classified as a good prognosis profile if P x is greater 
than a selected correlation threshold or if P 2 is equal to or less than a selected correlation 
threshold. In another embodiment, y is classified as a poor prognosis profile if P x is less 
than a selected correlation threshold or if P 2 is above a selected correlation threshold. In still 
another embodiment, y is classified as a good prognosis profile if P x is greater than a first 
selected correlation threshold and y is classified as a poor* prognosis profile if P 2 is greater 
than a second selected correlation threshold. 

[00133] In a preferred embodiment, the cellular constituent profile is an expression profile 
comprising measurements of a plurality of transcripts (e.g., measured as mRNAs or cDNAs) 
in a sample derived from a patient, e.g., the plurality of transcripts corresponding to the 
markers in all or a portion of one of Tables 1-5. In this embodiment, the good prognosis 
template can be a good prognosis expression template comprising measurements of the 
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plurality of transcripts representative of expression levels of the transcripts in a plurality of 
good prognosis patients, and the poor prognosis template can be a. poor prognosis expression 
template comprising measurements of the plurality of transcripts representative of expression 
levels of the transcripts in a plurality of poor prognosis patients. In a preferred embodiment, 
measurement of each transcript in the good or poor prognosis expression template is an 
average of expression levels of the transcript in the plurality of good or poor prognosis 
patients, respectively. 

[00134] In another embodiment, the expression profile is a differential expression profile 
comprising differential measurements of the plurality of transcripts in a sample derived from 
the patient versus measurements of the plurality of transcripts in sl control sample. The 
differential measurements can be xdev, log(ratio), error-weighted, log(ratio), or a mean 
subtracted log(intensity) (see, e.g., Stoughton et al. 5 PCT publication WO 00/39339, 
published on July 6, 2000; U.S. Patent Application No. 10/848,755, filed May 18, 2004, by 
Mao et al., attorney docket no: 9301-188-999, each of which is incorporated herein by 
reference in its entirety). 

5.4.3 IMPROVING SENSITIVITY TO EXPRB SSION LEVEL 
DIFFERENCES 

[00135] In using the markers disclosed herein, and, indeed, using any sets of markers, e.g., to 
compare profiles or to differentiate an individual having one phenotype from another 
individual having a second phenotype, one can compare the profile comprising absolute 
expression levels of the markers in a sample to a template; for example, a template 
comprising the average levels of expression of the markers in a plurality of individuals. To 
increase the sensitivity of the comparison, however, the expression level values are preferably 
transformed in a number of ways. Also, to differentiate an individual having one phenotype 
from another individual having a second phenotype using any sets of markers, one can 
compare the absolute expression of each of the markers in a sample to a control; for example, 
the control can be the average level of expression of each of the markers, respectively, in a 
pool of individuals. 

[00136] For example, the expression level of each of the markers can be normalized by the 
average expression level of all markers the expression level of whdch is determined, or by the 
average expression level of a set of control genes. Thus, in one embodiment, the markers are 
represented by probes on a microarray, and the expression level of each of the markers is 
normalized by the mean or median expression level across all of the genes represented on the 
microarray, including any non-marker genes. In a specific embodiment, the normalization is 
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carried out by dividing the median or mean level of expression of all of the genes on the 
microarray. In another embodiment, the expression levels of the markers is normalized by 
the mean or median level of expression of a set of control markers. In a specific 
embodiment, the control markers comprise a set of housekeeping genes. In another specific 
embodiments the normalization is accomplished by dividing by the median or mean 
expression level of the control genes. 

[00137] The sensitivity of a marker-based assay will also b>e increased if the expression 
levels of individual markers are compared to the expression of the same markers in a pool of 
samples. Preferably, the comparison is to the mean or median expression level of each the 
marker genes in the pool of samples. Such a comparison may be accomplished, for example, 
by dividing by the mean or median expression level of the pool for each of the markers from 
the expression level each of the markers in the sample. This has the effect of accentuating the 
relative differences in expression between markers in the sample and markers in the pool as a 
whole, making comparisons more sensitive and more likely to produce meaningful results 
that the use of absolute expression levels alone. The expression level data may be 
transformed in any convenient way; preferably, the expression level data for all is log 
transformed before means or medians are taken. 

[00138] In performing comparisons to a pool, two approaches may be used. First, the 
expression levels of the markers in the sample may be compared to the expression level of 
those markers in the pool, where nucleic acid derived froirx the sample and nucleic acid 
derived from the pool are hybridized during the course of a single experiment. Such an 
approach requires that new pool nucleic acid be generated for each comparison or limited 
numbers of comparisons, and is therefore limited by the amount of nucleic acid available. 
Alternatively, and preferably, the expression levels in a pool, whether normalized and/or 
transformed or not, are stored on a computer, or on compirter-readable media, to be used in 
comparisons to the individual expression level data from the sample (i.e., single-channel 
data). 

[00139] The current invention also provides the following method of classifying a first cell or 
organism as having one of at least two different phenotypes, where the different phenotypes 
comprise a first phenotype and a second phenotype. The level of expression of each of a 
plurality of markers in a first sample from the first cell or organism is compared to the level 
of expression of each of said markers, respectively, in a pooled sample from a plurality of 
cells or organisms, the plurality of cells or organisms comprising different cells or organisms 
exhibiting said at least two different phenotypes, respectively, to produce a first compared 
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value. The first compared value is then compared to a second compared value, wherein said 
second compared value is the product of a method comprising comparing the level of 
expression of each of said markers in a sample from a cell or organism characterized as 
having said first phenotype to the level of expression of each of said markers, respectively, in 
the pooled sample. The first compared value is then compared to a third compared value, 
wherein said third compared value is the product of a method comprising comparing the level 
of expression of each of the markers in a sample from a cell or organism characterized as 
having the second phenotype to the level of expression of each of the markers, respectively, 
in the pooled sample. In specific embodiments, the marker can be a gene, a protein encoded 
by the gene, etc. Optionally, the first compared value can be compared to additional 
compared values, respectively, where each additional compared value is the product of a 
method comprising comparing the level of expression of each of said markers in a sample 
from a cell or organism characterized as having a phenotype different from said first and 
second phenotypes but included among the at least two different phenotypes, to the level of 
expression of each of said genes, respectively, in said pooled sample. Finally, a 
determination is made as to which of said second, third, and, if present, one or more 
additional compared values, said first compared value is most similar, wherein the first cell or 
organism is determined to have the phenotype of the cell or organism used to produce said 
compared value most similar to said first compared value. 

[00140] The sensitivity of a marker-based assay will also be increased if the expression 
levels of individual markers are compared to the expression of tlxe same markers in a control 
sample, e.g., a sample comprises a pool of samples, to generate a differential expression 
profile. Such a comparison may be accomplished, for example, by determining a ratio 
between, expression level of each marker in the sample and the expression level of the 
corresponding marker in the control sample. This has the effect of accentuating the relative 
differences in expression between markers in the sample and markers in the control as a 
whole, making subsequent comparisons to a template more sensitive and more likely to 
produce meaningful results than the use of absolute expression levels alone. The comparison 
may be performed in any convenient way, e.g., by taking difference, ratio, or log(ratio). 
[00141] In performing comparisons to a control sample, two approaches may be used. First, 
the expression levels of the markers in the sample may be compared to the expression level of 
those markers in the control sample, where nucleic acid derived from the sample and nucleic 
acid derived from the control are hybridized during the course of a single experiment. Such 
an approach requires that new control sample of nucleic acid be generated for each 

63 



WO 2005/086891 



PCT/US2005/007894 



comparison or limited numbers of comparisons, and is therefore limited by the amount of 
nucleic acid available. Alternatively, the expression levels in a control sample, whether 
normalized and/or transformed or not, are stored on a computer, or on computer-readable 
media, to be used in comparisons to the individual expression, level data from the sample (i.e., 
single-channel data). 

[00142] The methods of the invention preferably use a control or reference sample, which 
can be any suitable sample against which changes in cellular constituents can be determined. 
In one embodiment, the control or reference sample is generated by pooling together the 
plurality of cellular constituents, e.g., a plurality of transcripts or cDNAs, or a plurality of 
protein species, from a plurality of breast cancer patients. Alternatively, the control or 
reference sample can be generated by pooling together purified or synthesized cellular 
constituents, e.g., a plurality of purified or synthesized transcripts or cDNAs, a plurality of 
purified or synthesized protein species. In one embodiment, synthetic RNAs for each 
transcripts or cDNAs are pooled to form the control or reference sample. Preferably, the 
abundances of synthetic RNAs are approximately the abundances of the corresponding 
transcripts in a real tumor pool. The differential expression of marker genes for each 
individual patient sample is measured against this control sample. In one embodiment, 60- 
mer oligonucleotides corresponding to the probe sequences on. a microarray used to assay the 
expression levels of the diagnostic/prognostic transcripts are synthesized and cloned into 
pBluescript SK- vector (Statagene, La Jolla, CA), adjacent to the T7 promoter sequence. 
Individual clones are isolated, and the sequences of their inserts are verified by DNA 
sequencing. To generate synthetic RNAs, clones are linearized with £coRI and a T7 in vitro 
transcription (IVT) reaction is performed by MegaScript kit (Ambion, Austin, TX), followed 
by DNase treatment of the product. Synthetic RNAs are purified on RNeasy columns 
(Qiagen, Valencia, CA). These synthetic RNAs are transcribed, amplified, labeled, and 
mixed together to make the reference pool. The abundance of those synthetic RNAs are 
chosen to approximate the abundances of the transcripts of the corresponding marker genes in 
the real tumor pool. 

[00143] The current invention provides the following method of classifying a first cell or 
organism as having one of at least two different phenotypes, where the different phenotypes 
comprise a first phenotype and a second phenotype. The level of expression of each of a 
plurality of markers in a first sample from the first cell or organism is compared to the level 
of expression of each of said markers, respectively, in a pooled sample from a plurality of 
cells or organisms, the plurality of cells or organisms comprising different cells or organisms 
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exhibiting said at least two different phenotypes, respectively, to produce a first compared 
value so that a first differential profile comprising a plurality of first compared values for said 
plurality of markers is generated. The first differential profile is then compared to a second 
differential profile comprising a plurality of second compared values, wherein each said 
second compared value is the product of a method comprising comparing the level of 
expression of each of said markers in a sample from a cell or organism characterized as 
having said first phenotype to the level of expression of each of said markers, respectively, in 
the pooled sample. The first differential profile is then compared to a third differential profile 
comprising a plurality of third compared values, wherein each said third compared value is 
the product of a method comprising comparing the level of expression of each of the markers 
in a sample from a cell or organism characterized as having the second phenotype to the level 
of expression of each of the markers, respectively, in the pooled sample. In specific 
embodiments, each marker can be a gene, a protein encoded by the gene, etc. Optionally, the 
first differential profile can be compared to additional expression profiles each of which 
comprising additional compared values, respectively, where each additional compared value 
is the product of a method comprising comparing the level of expression of each of said 
markers in a sample from a cell or organism characterized as having a phenotype different 
from said first and second phenotypes but included among the at least two different 
phenotypes, to the level of expression of each of said genes, respectively, in said pooled 
sample. Finally, a determination is made as to which of said second, third, and, if present, 
one or more additional differential profiles, said first differential profile is most similar, 
wherein the first cell or organism is determined to have the phenotype of the cell or organism 
used to produce said differential profile most similar to said first differential profile. 
[00144] In a specific embodiment of this method, the compared values are each ratios of the 
levels of expression of each of said genes. In another specific embodiment, each of the levels 
of expression of each of the genes in the pooled sample are normalized prior to any of the 
comparing steps. In a more specific embodiment, the normalization of the levels of 
expression is carried out by dividing by the median or mean le-vel of the expression of each of 
the genes or dividing by the mean or median level of expression of one or more housekeeping 
genes in the pooled sample from said cell or organism. In another specific embodiment, the 
normalized levels of expression are subjected to a log transform, and the comparing steps 
comprise subtracting the log transform from the log of the levels of expression of each of the 
genes in the sample. In another specific embodiment, the two or more different phenotypes 
are different stages of a disease or disorder. In still another specific embodiment, the two or 
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more different phenotypes are different prognoses of a disease or disorder. In yet another 
specific embodiment, the levels of expression of each of th.e genes, respectively, in the pooled 
sample or said levels of expression of each of said genes in a sample from the cell or 
organism characterized as having the first phenotype, second phenotype, or said phenotype 
different from said first and second phenotypes, respectively, are stored on a computer or on 
a computer-readable medium. 

[00 145] In another specific embodiment, the two phenotypes are good prognosis and poor 
prognosis. In a more specific embodiment, the two phenotypes are good prognosis and poor 
prognosis for an individual that is identified as having ER~ BRCA1 status, ER~, sporadic 
status, ER+, ER/AGE high status, ER+, ER/AGE low, LN-H status, or ER+, ER/AGE low, 
LN-+- status. 

[00146] In another specific embodiment, the comparison is made between the expression 
profile of the genes in the sample and the expression profile of the same genes in a pool 
representing only one of two or more phenotypes. In the context of prognosis-correlated 
genes, for example, one can compare the expression levels of prognosis-related genes in a 
sample to the average levels of the expression of the same genes in a plurality of "good 
prognosis" samples (as opposed to a plurality of samples th.at include samples from patients 
having poor prognoses and good prognoses). Thus, in this method, a sample is classified as 
having a good prognosis if the expression profile of prognosis-correlated genes exceeds a 
chosen coefficient of correlation to the average "good prognosis" expression profile (e.g., the 
profile comprising average levels of expression of prognosis-correlated genes in samples 
from a plurality of patients having a "good prognosis"). Patients whose expression profiles 
correlate more poorly with the "good prognosis" expression profile (e.g., whose correlation 
coefficient fails to exceed the chosen coefficient) are classified as having a poor prognosis. 
[001 47] Where individuals are classified on the basis of phenotypic, genotypic, or clinical 
characteristics into patient subsets, the pool of samples may- be a pool of samples for the 
phenotype that includes samples representing each of the patient subsets. Alternatively, the 
pool of samples may be a pool of samples for the phenotype representing only the specific 
patient subset. For example, where an individual is classified as ER+, sporadic, the pool of 
samples to which the individual's sample is compared may be a pool of samples from ER+, 
sporadic individuals having a good prognosis only, or may be a pool of samples of 
individuals having a good prognosis, without regard to ER status or mutation type. 
[001 48] The method can be applied to a plurality of patient subsets. For example, in a 
specific embodiment, the phenotype is good prognosis, and the individual is classified into 
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one of the following patient subsets: ER~ BRCA1 status, ER~ sporadic status, ER+, 
ER/AGE high status, ER+, ER/AGE low, LN+ status, or ER+, ER/AGE low, LN+ status. A 
set of markers informative for prognosis for the patient subset into which the individual is 
classified is then used to determine the likely prognosis fox the individual. A sample is 
classified as coming from an individual having a good prognosis if the expression profile of 
prognosis-correlated genes for the particular subset into wiiich the individual is classified 
exceeds a chosen coefficient of correlation to the average ^'good prognosis" expression 
profile (e.g., the levels of expression of prognosis-correlated genes in a plurality of samples 
from patients within the subclass having a "good prognosis"). Patients whose expression 
levels correlate more poorly with the "good prognosis" expression profile (e.g., whose 
correlation coefficient fails to exceed the chosen coefficient) are classified as having a poor 
prognosis. 

[00149] Of course, single-channel data may also be used without specific comparison to a 
mathematical sample pool. For example, a sample may be classified as having a first or a 
secorxd phenotype, wherein the first and second phenotypes are related, by calculating the 
similarity between the expression profile of at least 5 markers in the sample, where the 
markers are correlated with the first or second phenotype, to a first phenotype template and a 
second phenotype template each comprising the expression levels of the same markers, by (a) 
labeling nucleic acids derived from a sample with a fluorophore to obtain a pool of 
fluorophore-labeled nucleic acids; (b) contacting said fluorophore-labeled nucleic acid with a 
microarray under conditions such that hybridization can occur, detecting at each of a plurality 
of discrete loci on the microarray a fluorescent emission signal from said fluorophore-labeled 
nucleic acid that is bound to said microarray under said conditions; and (c) determining the 
similarity of marker gene expression in the individual sample to the first and second 
templates, wherein if said expression is more similar to the first template, the sample is 
classified as having the first phenotype, and if said expression is more similar to the second 
template, the sample is classified as having the second phenotype. 
[01 0O] In a specific embodiment of the above method, tfcie first phenotype is a good 
prognosis of breast cancer, the sample is a sample from an individual that has been classified 
into a patient subset, and the first and second templates are templates for the phenotype for 
the particular patient subset. In a more specific embodiment, for example, the first phenotype 
is a good prognosis, the second phenotype is a poor prognosis, the patient is classified into an 
ER~~, sporadic patient subset, an ER~ BRCA1 subset, an ER+, ER/AGE high subset, an ER+, 
ER/A.GE low, LN+ subset, or an ER+, ER/AGE low, LN-+- subset, and said first and second 
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templates are templates derived from the expression, of the marker genes in individuals 
having a good prognosis and a poor prognosis, respectively, wherein said individuals are all 
of the patient subset into which said patient is classified. 

5.5 DETERMINATION OF MARKER GENE EXPRESSION LEVELS 
5.5.1 METHODS 

[00150] The expression levels of the marker genes in a sample may be determined by any 
means known in the art. The expression level may be determined by isolating and 
determining the level (i.e., amount) of nucleic acid transcribed from each marker gene. 
Alternatively, or additionally, the level of specific proteins encoded by a marker gene may be 
determined. 

[00151] The level of expression of specific marker genes can be accomplished by 
determining the amount of mRNA, or polynucleotides derived therefrom, present in a sample. 
Any method for determining RNA levels can be used. For example, RNA is isolated from a 
sample and separated on an agarose gel. The separated RNA is then transferred to a solid 
support, such as a filter. Nucleic acid probes representing one or more markers are then 
hybridized to the filter by northern hybridization, and the amount of marker-derived RNA is 
determined. Such determination can be visual, or machine-aided, for example, by use of a 
densitometer. Another method of determining RNA levels is by use of a dot-blot or a slot- 
blot. In this method, RNA, or nucleic acid derived tlierefrom, from a sample is labeled. The 
RNA or nucleic acid derived therefrom is then hybridized to a filter containing 
oligonucleotides derived from one or more marker genes, wherein the oligonucleotides are 
placed upon the filter at discrete, easily-identifiable locations. Hybridization, or lack thereof, 
of the labeled RNA to the filter-bound oligonucleotides is determined visually or by 
densitometer. Polynucleotides can be labeled using a radiolabel or a fluorescent (i.e., visible) 
label. 

[00152] These examples are not intended to be limiting; other methods of determining RNA 
abundance are known in the art. 

[00153] The level of expression of particular marker genes may also be assessed by 
determining the level of the specific protein expressed from the marker genes. This can be 
accomplished, for example, by separation of proteins from a sample on a polyacrylamide gel, 
followed by identification of specific marker-derived proteins using antibodies in a western 
blot. Alternatively, proteins can be separated by two-dimensional gel electrophoresis 
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systems. Two-dimensional gel electrophoresis is well-known in the art and typically involves 
isoelectric focusing along a first dimension followed by SDS-PAGE electrophoresis along a 
second dimension. See, e.g., Hames et al, 1990, Gel Electrophoresis of Proteins: A 
Practical Approach, IRL Press, New York; Shevchenko et aL, Proc. Natl Acad. Sci. USA 
93:1440-1445 (1996); Sagliocco et aU Yeast 12:1519-1533 (1996); Lander, Science 
274:536-539 (1996). The resulting electropherograms can be analyzed by numerous 
techniques, including mass spectrometric techniques, western blotting and immunoblot 
analysis using polyclonal and monoclonal antibodies. 

[00154] Alternatively, marker-derived protein levels can be determined by constructing an 
antibody microarray in which binding sites comprise immobilized, preferably monoclonal, 
antibodies specific to a plurality of protein species encoded by the cell genome. Preferably, 
antibodies are present for a substantial fraction of the marker-derived proteins of interest. 
Methods for making monoclonal antibodies are well known (see, e.g., Harlow and Lane, 
1 988, Antibodies: A Laboratory Manual, Cold Spring Harbor, New York, which is 
incorporated in its entirety for all purposes). In one embodiment, monoclonal antibodies are 
raised against synthetic peptide fragments designed based on genomic sequence of the cell. 
With such an antibody array, proteins from the cell are contacted to the array, and their 
binding is assayed with assays known in the art. Generally, the expression, and the level of 
expression, of proteins of diagnostic or prognostic interest can be detected through 
immunohistochemical staining of tissue slices or sections. 

[00155] Finally, expression of marker genes in a number of tissue specimens may be 
characterized using a "tissue array" (Kononen et al., Nat- Med 4(7):844-7 (1998)). In a tissue 
array, multiple tissue samples are assessed on the same microarray. The arrays allow in situ 
detection of RNA and protein levels; consecutive sections allow the analysis of multiple 
samples simultaneously. 

5.5.2 MICROARRAYS 
[00156] In preferred embodiments, polynucleotide micro arrays are used to measure 
expression so that the expression status of each of the markers above is assessed 
simultaneously. Generally, microarrays according to the invention comprise a plurality of 
markers informative for prognosis, or outcome determination, for a particular disease or 
condition, and, in particular, for individuals having specific combinations of genotypic or 
phenotypic characteristics of the disease or condition that are prognosis-informative for 
a particular patient subset). 
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[00157] The microarrays of the invention preferably comprise at least 2, 3, 4, 5, 7, 10, 15, 20, 
25, 30, 35, 40, 45, 50, 75, 100, 150, 200 or more of markers, or all of the markers, or any 
combination of markers, identified as prognosis-informative within a patient subset. The 
actual number of informative markers the micro array comprises will vary depending upon the 
particular condition of interest, the number of markers identified, and, optionally, the number 
of informative markers found to result in the least Type I error, Type II error, or Type I and 
Type II error in determination of prognosis. As used herein, "Type I error" means a false 
positive and 'Type II error" means a false negative; in the example of prognosis of beast 
cancer, Type I error is the mis-characterization of an individual with a good prognosis as 
having a poor prognosis, and Type II error is the mis-characterization of an individual with a 
poor prognosis as having a good prognosis. 

[00158] In specific embodiments, the invention provides polynucleotide arrays in which the 
prognosis markers identified for a particular patient subset comprise at least 50%, 60%, 70%, 
80%, 85%, 90%, 95% or 98% of the probes on said array. In another specific embodiment, 
the microarray comprises a plurality of probes, wherein said plurality of probes comprise 
probes complementary and hybridizable to at least 75% of the prognosis-informative markers 
identified for a particular patient subset. Microarrays of the invention, of course, may 
comprise probes complementary and hybridizable to prognosis-informative markers for a 
plurality of the patient subsets, or for each patient subset, identified for a particular condition. 
In another embodiment, therefore, the microarray of the invention comprises a plurality of 
probes complementary and hybridizable to at least 75% of the prognosis-informative markers 
identified for each patient subset identified for the condition of interest, and wherein said 
probes, in total, are at least 50% of the probes on said microarray. 
[00159] In yet another specific embodiment, microarrays that are used in the methods 
disclosed herein optionally comprise markers additional to at least some of the markers 
identified by the methods disclosed elsewhere herein. For example, in a specific 
embodiment, the microarray is a screening or scanning array as described in Altschuler et al. 9 
International Publication WO 02/18646, published IMarch 7, 2002 and Scherer et aL, 
International Publication WO 02/16650, published February 28, 2002. The scanning and 
screening arrays comprise regularly-spaced, positionally-addressable probes derived from 
genomic nucleic acid sequence, both expressed and unexpressed. Such arrays may comprise 
probes corresponding to a subset of, or all of, the markers identified for the patient subset(s) 
for the condition of interest, and can be used to monitor marker expression in the same way 
as a microarray containing only prognosis-informative markers otherwise identified. 
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[O0160] In yet another specific embodiment, the microarray is a commercially- available 
cDNA microarray that comprises at least five markers identified by the methods described 
herein. Preferably, a commercially-available cDNA microarray comprises all of the markers 
identified by the methods described herein as being informative for a patient subset for a 
particular condition. However, such a microarray may comprise at least 5, 10, 15 or 25 of 
such markers, up to the maximum number of markers identified. 
[O0161] In an embodiment specific to breast cancer, the invention provides for 
oligonucleotide or cDNA arrays comprising probes hj^bridizable to the genes corresponding 
to each of the marker sets described above (i.e., markers informative for ER _ , sporadic 
individuals, markers informative for ERT, BRCA1 individuals, markers informative for ER+, 
ER/AGE high individuals, markers informative for ER/AGE low, LN+ individuals, and 

markers informative for ER+, ER/AGE low, LN" individuals, as shown in Tables 1-5). Any 
of the microarrays described herein may be provided in a sealed container in a kit. 
[O0162] The invention provides microarrays containing probes useful for the prognosis of 
any breast cancer patient, or for breast cancer patients classified into one of a plurality of 
patient subsets. In particular, the invention provides polynucleotide arrays comprising probes 
to a subset or subsets of at least 5, 10, 1 5, 20, 25 or more of the genetic markers, or up to the 
full set of markers, in any of Tables 1-5, which distinguish between patients with good and 
poor prognosis. In certain embodiments, therefore, the invention provides microarrays 
comprising probes for a plurality of the genes for which markers are listed in Tables 1, 2, 3, 4 
or 5. In a specific embodiment, the microarray of the invention comprises 1, 2, 3, 4, 5 or 10 
of the markers in Table 1, at least five of the markers in Table 2; 1, 2, 3, 4, 5 or 10 of the 
markers in Table 3; 1, 2, 3, 4, 5 or 10 of the markers in. Table 4; or 1, 2, 3, 4, 5 or 10 of the 
markers in Table 1. In other embodiments, the microarray comprises probes for 1, 2, 3, 4, 5, 
or 10 of the markers shown in any two, three or four of Tables 1-5, or all of Tables 1-5. In 
other embodiments, the microarray of the invention contains each of the markers in Table 1, 
Table 2, Table 3, Table 4, or Table 5. In another embodiment, the microarray contains all of 
the markers shown in Tables 1-5. In specific embodiments, the array comprises probes 
derived only from the markers listed in Table 1, Table 2, Table 3, Table 4, or Table 5; probes 
derived from any two of Tables 1-5; any three of Tables 1-5; any four of Tables 1-5; or all of 
Tables 1-5. 

[O0163] In other embodiments, the array comprises a plurality of probes derived from 
markers listed in any of Tables 1-5 in combination with a plurality of other probes, derived 
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from markers not listed in any of Tables 1-5, that are identified as informative for the 
prognosis of breast cancer. 

[00164] In specific embodiments, the invention provides polynucleotide arrays in which the 
breast cancer prognosis markers described herein in Tables 1, 2, 3, 4 and/or 5 com/prise at 
least 50% 5 60%, 70%, 80%, 85%, 90%, 95% or 98% of the probes on said array. In another 
specific embodiment, the microarray comprises a plurality of probes, wherein said plurality 
of probes comprise probes complementary and hybridizable to at least 75% of the genes for 
which markers are listed in Table 1; probes complementary and hybridizable to at least 75% 
of the genes for which markers are listed in Table 2; probes complementary and hybridizable 
to at least 75% of the genes for which markers are listed in Table 3 ; probes complementary 
and hybridizable to at least 75% of the genes for which markers are listed in Table 4; and 
probes complementary and hybridizable to at least 75% of the genes for which markers are 
listed in Table 5, wherein said probes, in total, are at least 50% of the probes on said 
microarray. 

[00165] In yet another specific embodiment, microarrays that are used in the mettatods 
disclosed herein optionally comprise markers additional to at least some of the markers listed 
in Tables 1-5. For example, in a specific embodiment, the microarray is a screening or 
scanning array as described in Altschuler et aL, International Publication WO 02/1 8646, 
published March 7, 2002 and Scherer et aL, International Publication WO 02/16650, 
published February 28, 2002. The scanning and screening arrays comprise regulaarly-spaced, 
positionally-addressable probes derived from genomic nucleic acid sequence, both* expressed 
and unexpressed. Such arrays may comprise probes corresponding to a subset of, or all of, 
the markers listed in Tables 1-5, or a subset thereof as described above, and can be used to 
monitor marker expression in the same way as a microarray containing only markers listed in 
Tables 1-5. 

[00166] In yet another specific embodiment, the microarray is a commercially-available 
cDNA microarray that comprises at least five of the markers listed in Tables 1-5. Preferably, 
a commercially- available cDNA microarray comprises all of the markers listed in Tables 1-5. 
However, such a microarray may comprise at least 5, 10, 15 or 25 of the markers in any of 
Tables 1-5, up to the maximum number of markers in a Table, and may comprise all of the 
markers in any one of Tables 1-5, and a subset of another of Tables 1-5, or subsets of each as 
described above. In a specific embodiment of the microarrays used in the methods disclosed 
herein, the markers that are all or a portion of Tables 1-5 make up at least 50%, 60%, 70%, 
80%, 90%, 95% or 98% of the probes on the microarray. 
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[00167] General methods pertaining to the construction of microarrays comprising the 

marker sets and/or subsets above are described in the following sections. 

[00168] 

[00169] 

5.5.2.1 CONSTRUCTION OF MICROARRAYS 
[001 70] Microarrays are prepared by selecting probes which comprise a polynucleotide 
sequence, and then immobilizing such probes to a solid support or surface. For example, the 
probes may comprise DNA sequences, RNA sequences, or copolymer sequences of DNA and 
RNA.. The polynucleotide sequences of the probes may also comprise DNA and/or RNA 
analogues, or combinations thereof. For example, the polynucleotide sequences of the probes 
may te full or partial fragments of genomic DNA. The polynucleotide sequences of the 
probes may also be synthesized nucleotide sequences, such as synthetic oligonucleotide 
sequences. The probe sequences can be synthesized either enzymatically in vivo, 
enzyrnatically in vitro (e.g., by PGR), or non-enzymatically in vitro. 

[001 71] The probe or probes used in the methods of the invention are preferably immobilized 
to a solid support which may be either porous or non-porous. For example, the probes of the 
invention may be polynucleotide sequences which are attached to a nitrocellulose or nylon 
membrane or filter covalently at either the 3 5 or the 5' end of the polynucleotide. Such 
hybridization probes are well known in the art (see, e.g., Sambrook et al., MOLECULAR 
Cloning - A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor Laboratory, 
Cold Spring Harbor, New York (1989). Alternatively, the solid support or surface may be a 
glass or plastic surface. In a particularly preferred embodiment, hybridization levels are 
measured to microarrays of probes consisting of a solid phase on the surface of which are 
immobilized a population of polynucleotides, such as a population of DNA or DNA mimics, 
or, alternatively, a population of RNA or RNA mimics. The solid phase may^ be a nonporous 
or, optionally, a porous material such as a gel. 

[001 72] In preferred embodiments, a microarray comprises a support or surface with an 
ordered array of binding (e.g., hybridization) sites or "probes" each representing one of the 
marlcers described herein. Preferably the microarrays are addressable arrays., and more 
preferably positionally addressable arrays. More specifically, each probe of the array is 
preferably located at a known, predetermined position on the solid support siach that the 
identity (i.e., the sequence) of each probe can be determined from its position in the array 
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(i.e., on the support or surface). In preferred embodiments, each probe is covalerrtly attached 
to the solid support at a single site. 

[00173] Microarrays can be made in a number of ways, of which several are described 
below. However produced, microarrays share certain characteristics. The aiTays are 
reproducible, allowing multiple copies of a given array to be produced and easily compared 
with each other. Preferably, microarrays are made from materials that are stable vander 
binding (e.g., nucleic acid hybridization) conditions. The microarrays are preferably small, 
e.g., between 1 cm 2 and 25 cm 2 , between 12 cm 2 and 13 cm 2 , or 3 cm 2 . However 5 larger 
arrays are also contemplated, and may be preferable, e.g., for use in screening arrays. 
Preferably, a given binding site or unique set of binding sites in the microarray will 
specifically bind (e.g., hybridize) to the product of a single gene in a cell (e.g., to a specific 
mRNA, or to a specific cDNA derived therefrom). However, in general, other related or 
similar sequences will cross hybridize to a given binding site. 

[00174] The microarrays of the present invention include one or more test probes, each of 
which has a polynucleotide sequence that is complementary to a subsequence ofENA or 
DNA to be detected. Preferably, the position of each probe on the solid surface is known. 
Indeed, the microarrays are preferably positionally addressable arrays. Specifically, each 
probe of the array is preferably located at a known, predetermined position on the solid 
support such that the identity (i.e., the sequence) of each probe can be determined from its 
position on the array (i.e., on the support or surface). 

[00175] According to the invention, the microarray is an array (i.e., a matrix) in which each 
position represents one of the markers described herein. For example, each position can 
contain a DNA or DNA analogue based on genomic DNA to which a particular RNA or 
cDNA transcribed from that genetic marker can specifically hybridize. The DNA or DNA 
analogue can be, e.g., a synthetic oligomer or a gene fragment. In one embodiment, probes 
representing each of the markers is present on the array. In a preferred embodiment, the array 
comprises probes for each of the markers listed in Tables 1-5. 

5.5.2.2 PREPARING PROBES FOR MICROARRAYS 
[00176] As noted above, the "probe" to which a particular polynucleotide molecxile 
specifically hybridizes according to the invention contains a complementary genomic 
polynucleotide sequence. The probes of the microarray preferably consist of nucleotide 
sequences of no more than 1,000 nucleotides. In some embodiments, the probes of the array 
consist of nucleotide sequences of 10 to 1,000 nucleotides. In a preferred embodiment, the 
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nucleotide sequences of the probes are in the range of 10-200 nucleotides in length and are 
genomic sequences of a species of organism, such that a plurality of different probes is 
present, with sequences complementary and thus capable of hybridizing to the genome of 
such a species of organism, sequentially tiled across all or a portion of such genome. In other 
specific embodiments, the probes are in the range of 10-30 nucleotides in lexigth, in the range 
of 10-40 nucleotides in length, in the range of 20-50 nucleotides in length, in the range of 40- 
80 nucleotides in length, in the range of 50-150 nucleotides in length, in the range of 80-120 
nucleotides in length, and most preferably are 60 nucleotides in length. 

[00177] The probes may comprise DNA or DNA "mimics" (e.g., derivatives and analogues) 
corresponding to a portion of an organism's genome. In another embodiment, the probes of 
the microarray are complementary RNA or RNA mimics. DNA mimics are polymers 
composed of subunits capable of specific, Watson-Crick-like hybridization ^with DNA, or of 
specific hybridization with RNA. The nucleic acids can be modified at the fcase moiety, at 
the sugar moiety, or at the phosphate backbone. Exemplary DNA mimics include, e.g., 
phosphorothioates. 

[001 78] DNA can be obtained, e.g., by polymerase chain reaction (PGR) amplification of 
genomic DNA or cloned sequences. PGR primers are preferably chosen based on a known 
sequence of the genome that will result in amplification of specific fragments of genomic 
DNA. Computer programs that are well known in the art are useful in the design of primers 
with the required specificity and optimal amplification properties, such as Oligo version 5.0 
(National Biosciences). Typically each probe on the microarray will be bet-ween 10 bases 
and 50,000 bases, usually between 300 bases and 1,000 bases in length. PCR methods are 
well known in the art, and are described, for example, in Innis et al, eds., PCR Protocols: 
A Guide to Methods and Applications, Academic Press Inc., San Diego, CA (1990). It 
will be apparent to one skilled in the art that controlled robotic systems are -useful for 
isolating and amplifying nucleic acids. 

[00179] An alternative, preferred means for generating the polynucleotide probes of the 
microarray is by synthesis of synthetic polynucleotides or oligonucleotides* e.g., using N- 
phosphonate or phosphoramidite chemistries (Froehler et al, Nucleic Acid JR.es. 14:5399-5407 
(1986); McBride et al, Tetrahedron Lett. 24:246-248 (1983)). Synthetic sequences are 
typically between about 10 and about 500 bases in length, more typically between about 20 
and about 100 bases, and most preferably between about 40 and about 70 bases in length. In 
some embodiments, synthetic nucleic acids include non-natural bases, such as, but by no 
means limited to, inosine. As noted above, nucleic acid analogues may be xised as binding 
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sites for hybridization. An example of a suitable nucleic acid analogue is peptide nucleic acid 
(see, e.g., Egholm etal, Nature 363:566-568 (1993); U.S. Patent No. 5,539,083). 
[00180] Probes are preferably selected using an algorithm that takes into account binding 
energies, base composition., sequence complexity, cross-hybridization binding energies, and 
secondary structure. See Friend et al, International Patent Publication WO 01/05935, 
published January 25, 2001 ; Hughes et ah, Nat. Biotech. 19:342-7 (2001). 
[00181] A skilled artisan will also appreciate that positive control probes, e.g., probes known 
to be complementary and hiybridizable to sequences in the target polynucleotide molecules, 
and negative control probes, e.g., probes known to not be complementary and hyt>ridizable to 
sequences in the target polynucleotide molecules, should be included on the array. In one 
embodiment, positive controls are synthesized along the perimeter of the array. In another 
embodiment, positive controls are synthesized in diagonal stripes across the array. In still 
another embodiment, the reverse complement for each probe is synthesized next to the 
position of the probe to serve as a negative control. In yet another embodiment, sequences 
from other species of organism are used as negative controls or as "spike-in" controls. 

5.5.2.3 ATTACHING PROBES TO THE SOLID SURFACE 
[00182] The probes are attached to a solid support or surface, which may be made, e.g., from 
glass, plastic (e.g., polypropylene, nylon), polyacrylamide, nitrocellulose, gel, or other porous 
or nonporous material. A preferred method for attaching the nucleic acids to a surface is by 
printing on glass plates, as is described generally by Schena et al, Science 270:467-470 

(1995) . This method is especially useful for preparing microarrays of cDNA (See also, 
DeRisi et al, Nature Genetics 14:457-460 (1996); Shalon et al, Genome Res. 6 : 639-645 

(1996) ; and Schena etal, J>roc. Natl. Acad. Sci. U.S.A. 93:10539-11286 (1995)). 
[00183] A second preferred method for making microarrays is by making high-density 
oligonucleotide arrays. Techniques are known for producing arrays containing thousands of 
oligonucleotides complementary to defined sequences, at defined locations on a surface using 
photolithographic techniques for synthesis in situ (see, Fodor etal, 1991, Science 251:767- 
773; Pease etal, 1994, Proc. Natl. Acad. Sci. U.S.A. 91:5022-5026; Lockhart et al, 1996, 
Nature Biotechnology 14:1675; U.S. Patent Nos. 5,578,832; 5,556,752; and 5,51 0 5 270) or 
other methods for rapid synthesis and deposition of defined oligonucleotides (BLanchard et 
al, Biosensors & Bioelectronics 1 1 :687-690). When these methods are used, 
oligonucleotides (e.g., 60-mers) of known sequence are synthesized directly on a surface such 
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as a derivatized glass slide. Usually, the array produced is redundant, with several 
oligonucleotide molecules per RNA. 

[00184] Other methods for making microarrays, e.g., by masking (Maskos and Southern, 
1992, Nuc. Acids. Res. 20: 1679-1684), may also be used. In principle, and as noted supra, 
any type of array, for example, dot blots on a nylon hybridization membrane (see Sambrook 
et al, Molecular Cloning - A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, New York (1989)) could be used. However, as will 
be recognized by those skilled in the art, very small arrays will frequently be preferred 
because hybridization vohxmes will be smaller. 

[00185] In one embodiment, the arrays of the present invention are prepared by synthesizing 
polynucleotide probes on a support. In such an embodiment, polynucleotide probes are 
attached to the support covalently at either the 3' or the 5' end of the polynucleotide. 
[00186] In a particularly preferred embodiment, microarrays of the invention are 
manufactured by means of an ink jet printing device for oligonucleotide synthesis, e.g., using 
the methods and systems described by Blanchard in U.S. Pat. No. 6,028,189; Blanchard et 
al., 1996, Biosensors and Bioelectronics 1 1 :687-690; Blanchard, 1998, in Synthetic DNA 
Arrays in Genetic Engineering, Vol. 20, J.K. Setlow, Ed., Plenum Press, New York: at pages 
1 1 1-123. Specifically, the oligonucleotide probes in such microarrays are preferably 
synthesized in arrays, e.g.,, on a glass slide, by serially depositing individual nucleotide bases 
in "microdroplets" of a high surface tension solvent such as propylene carbonate. The 
microdroplets have small -volumes (e.g., 100 pL or less, more preferably 50 pL or less) and 
are separated from each other on the microarray (e.g., by hydrophobic domains) to form 
circular surface tension wells which define the locations of the array elements (i.e.* the 
different probes). Microaxrays manufactured by this ink-jet method are typically of high 
density, preferably having a density of at least about 2,500 different probes per 1 cm 2 . The 
polynucleotide probes are attached to the support covalently at either the 3' or the 5' end of 
the polynucleotide. 

5.5.2.4 TARGET POLYNUCLEOTIDE MOLECULES 
[001 87] The polynucleotide molecules which may be analyzed by the present invention (the 
"target polynucleotide molecules") may be from any clinically relevant source, but are 
expressed RNA or a nucleic acid derived therefrom (e.g., cDNA or amplified RNA. derived 
from cDNA that incorporates an RNA polymerase promoter), including naturally occurring 
nucleic acid molecules, as well as synthetic nucleic acid molecules. In one embodiment, the 
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target polynucleotide molecules comprise RNA, including, but by no means limited to, total 
cellular RNA, poly( A) + messenger RNA (mRNA) or fraction thereof, cytoplasmic mRNA, or 
RNA transcribed from cDNA (i.e., cRNA; see, e.g., Linsley & Schelter, U.S. Patent 
Application No. 09/41 1,074, filed October 4, 1999, or U.S. Patent Nos. 5,545,522, 5,891,636, 
or 5,716,785). Methods for preparing total and poly(A) + RNA are well known in the art, and 
are described generally, e.g., in Sambrook et al, Molecular Cloning - A Laboratory 
Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, New 
York (1989). In one embodiment, RNA is extracted from cells of the various types of interest 
in this invention using guanidinium thiocyanate lysis followed by CsCl centrifugation 
(Chirgwin et al, 1979, Biochemistry 18:5294-5299). In another embodiment, total RNA is 
extracted using a silica gel-based column, commercially available examples of which include 
RNeasy (Qiagen, Valencia, California) and StrataPrep (Stratagene, La Jolla, California). In 
an alternative embodiment, which is preferred for S. cerevisiae, RNA is extracted from cells 
using phenol and chloroform, as described in Ausubel et al, eds., 1989, Currenjt 
Protocols in Molecular Biology, Vol. Ill, Green Publishing Associates, Inc., John Wiley 
& Sons, Inc., New York, at pp. 13.12.1-13.12.5). Poly(A)+ RNA can be selected, e.g., by 
selection with oligo-dT cellulose or, alternatively, by oligo-dT primed reverse transcription of 
total cellular RNA. In one embodiment, RNA can be fragmented by methods known in the 
art, e.g., by incubation with ZnCl 2 , to generate fragments of RNA. In another embodiment, 
the polynucleotide molecules analyzed by the invention comprise cDNA, or PCIR products of 
amplified RNA or cDNA. 

[00188] In one embodiment, total RNA, mRNA, or nucleic acids derived therefrom, is 
isolated from a sample taken from a person afflicted with breast cancer. Target 
polynucleotide molecules that are poorly expressed in particular cells may be enriched using 
normalization techniques (Bonaldo et al, 1996, Genome Res. 6:791-806). 
[00189] As described above, the target polynucleotides are detectably labeled at one or more 
nucleotides. Any method known in the art may be used to detectably label the target 
polynucleotides. Preferably, this labeling incorporates the label uniformly alorx g the length of 
the RNA, and more preferably, the labeling is carried out at a high degree of efficiency. One 
embodiment for this labeling uses oligo-dT primed reverse transcription to incorporate the 
label; however, conventional methods of this method are biased toward generating 3' end 
fragments. Thus, in a preferred embodiment, random primers (e.g., 9-mers) are used in 
reverse transcription to uniformly incorporate labeled nucleotides over the full length of the 
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target polynucleotides. Alternatively, random primers may be used in conjunction with PCR 
methods or T7 promoter-based in vitro transcription methods in order to amplify the target 
polynucleotides. 

[00190] In a preferred embodiment, the detectable label is a luminescent label. For example, 
fluorescent labels, bioluminescent labels, chemiluminescent labels, and colorimetric labels 
may be used in the present invention. In a highly preferred embodiment, the label is a 
fluorescent label, such as a fluorescein, a phosphor, a rhodamine, or a polymethine dye 
derivative. Examples of commercially available fluorescent labels include, for example, 
fluorescent phosphoramidites such as FluorePrime (Amersham Pharmacia, Piscataway, N.J.), 
Fluoredite (Millipore, Bedford, Mass.), FAM (ABI, Foster City, Calif), and Cy3 or Cy5 
(Amersliam Pharmacia, Piscataway, N.J.). In another embodiment, the detectable label is a 
radiolabeled nucleotide. 

[00191] In a further preferred embodiment, target polynucleotide molecules from a patient 
sample are labeled differentially from target polynucleotide molecules of a standard. The 
standard can comprise target polynucleotide molecules from normal individuals {i.e., those 
not afflicted with breast cancer). In a highly preferred embodiment, the standard comprises 
target polynucleotide molecules pooled from samples from normal individuals or tumor 
samples from individuals having sporadic-type breast tumors. In another embodiment, the 
target polynucleotide molecules are derived from the same individual, but are taken at 
different time points, and thus indicate the efficacy of a treatment by a change in expression 
of the markers, or lack thereof, during and after the course of treatment (i.e., chemotherapy, 
radiation therapy or cryotherapy), wherein a change in the expression of the markers from a 
poor prognosis pattern to a good prognosis pattern indicates that the treatment is efficacious. 
In this embodiment, different timepoints are differentially labeled. 

5.5.2.5 HYBRIDIZATION TO MICRO ARRAY" S 
[00192] Nucleic acid hybridization and wash conditions are chosen so that the target 
polynucleotide molecules specifically bind or specifically hybridize to the complementary 
polynucleotide sequences of the array, preferably to a specific array site, wherein its 
complementary DNA is located. 

[00193] Arrays containing double-stranded probe DNA situated thereon are preferably 
subjected to denaturing conditions to render the DNA single-stranded prior to contacting with 
the target polynucleotide molecules. Arrays containing single-stranded probe DNA (e.g., 
synthetic oligodeoxyribonucleic acids) may need to be denatured prior to contacting with the 
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target polynucleotide molecules, e.g., to remove hairpins or dimers which form due to self 
complementary sequences. 

[00194] Optimal hybridization conditions will depend on the length (e.g., oligomer versus 
polynucleotide greater than 200 bases) and type (e.g., RNA, or DNA) of probe and target 
nucleic acids. One of skill in the art will appreciate that as the oligonucleotides become 
shorter, it may become necessary to adjust their length to achieve a relatively uniform 
melting temperature for satisfactory hybridization results. General parameters for specific 
(i.e., stringent) hybridization conditions for nucleic acids are described in Sambrook et al, 
Molecular Cloning - A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York (1989), and in Ausubel et al, CURRENT 
Protocols in Molecular Biology, vol. 2, Current Protocols Publishing, New York 
(1994). Typical hybridization conditions for the cDNA microarrays of Schena et al. are 
hybridization in 5 X SSC plus 0.2% SDS at 65°C for four hours, followed by washes at 25°C 
in low stringency wash buffer (1 X SSC plus 0.2% SDS), followed by 10 minutes at 25°C in 
higher stringency wash buffer (0.1 X SSC plus 0.2*% SDS) (Schena et al, Proc. Natl. Acad. 
Set U.S.A. 93:10614 (1993)). Useful hybridization conditions are also provided in, e.g., 
Tijessen, 1993, Hybridization With Nucleic Acid Probes, Elsevier Science Publishers 
B.V.; and Kricka, 1992, Nonisotopic DNA Probe Techniques , Academic Press, San 
Diego, CA. 

[00195] Particularly preferred hybridization conditions include hybridization at a temperature 
at or near the mean melting temperature of the probes (e.g., within 51°C, more preferably 
within 21°C) in 1 M NaCl, 50 mM MES buffer (pH 6.5), 0.5% sodium sarcosine and 30% 
forrnamide. 

5.5.2.6 SIGNAL DETECTION AND DATA ANALYSIS 
[00196] When fluorescently labeled probes are used, the fluorescence emissions at each site 
of a microarray may be, preferably, detected by scanning confocal laser microscopy. In one 
embodiment, a separate scan, using the appropriate excitation line, is carried out for each of 
the two fluorophores used. Alternatively, a laser may be used that allows simultaneous 
specimen illumination at wavelengths specific to the two fluorophores and emissions from 
the two fluorophores can be analyzed simultaneously (see Shalon et al, 1996, "A DNA 
microarray system for analyzing complex DNA samples using two-color fluorescent probe 
hybridization," Genome Research 6:639-645, which is incorporated by reference in its 
entirety for all purposes). In a preferred embodiment, the arrays are scanned with a laser 
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fluorescent scanner with a computer controlled X-Y stage and a microscope objective. 
Sequential excitation of the two fluorophores is achieved with a multi-line, mixed gas laser 
and the emitted light is split by wavelength and detected with two photomultiplier tubes. 
Fluorescence laser scanning devices are described in Schena et al, Genome Res. 6:639-645 
(1996), and in other references cited herein. Alternatively, the fiber-optic bundle described 
by Ferguson et al., Nature Biotech. 14:1681-1684- (1996), may be used to monitor mRNA 
abundance levels at a large number of sites simultaneously. 

[00197] Signals are recorded and, in a preferred embodiment, analyzed by computer, e.g., 
using a 12 or 16 bit analog to digital board. In one embodiment the scanned image is 
despeckled using a graphics program (e.g., Hijaak Graphics Suite) and then analyzed using an 
image gridding program that creates a spreadsheet of the average hybridization at each 
wavelength at each site. If necessary, an experimentally determined correction for "cross 
talk" (or overlap) between the channels for the two fluors may be made. For any particular 
hybridization site on the transcript array, a ratio of the emission of the two fluorophores can 
be calculated. The ratio is independent of the absolute expression level of the cognate gene, 
but is useful for genes whose expression is significantly modulated in association with the 
different breast cancer-related condition. 

5.6 THERAPEUTIC REGIMENS SPECIFIC TO PATIENT SUBSETS 
[00198] The benefit of identifying subsets of individuals that have a common condition, 
followed by identification of sets of genes informative for those particular subsets of 
individuals, is that such subdivision and identification tends to more accurately identify the 
subset of genes responsible for, or most closely associated with, a particular form of the 
condition. For example, breast cancer is a complex condition brought about by several 
different molecular mechanisms. ER+ individuals, particularly ER+, ER/AGE high 
individuals, show an increased level of expression of cell cycle-control genes, and the 
expression of these genes is highly informative for prognosis in this patient subset (see 
Examples). In ER~ individuals, however, the expression of these genes is not informative for 
prognosis. 

[00199] The set of informative markers, therefore, can be used to assign a particular course 
of therapy to an individual, e.g., an individual having breast cancer, depending upon the 
condition subset into which the individual is classified. In one embodiment, therefore, the 
invention provides a method of assigning a course of therapy to an individual having a 
condition, said method comprising classifying the individual into one of a plurality of subsets 
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of a condition, wherein a plurality of informative genes has been identified for at least one of 
said subsets; and assigning a course of therapy known or suspected to be effective for treating 
the subset of the condition associated with those genes. In a specific embodiment, said 
condition is breast cancer, said patient subset is ER+, ER/AGE high status, and said course of 
therapy comprises the administration of one or more compounds known or suspected to be 
effective at arresting the cell cycle. In a more specific embodiment, said one or more 
compounds comprises taxol or a vinca alkaloid. 

[00200] Of course, any course of therapy selected or assigned on the basis of the above 
phenotypes and gene expression may be supplemented by other treatments or courses of 
therapy relevant to or known or suspected to be effective in the treatment of the condition. 
For example, the treatment of breast cancer may additionally comprise surgery, either tissue- 
preserving or radical, radiation treatment, chemotherapy other than that suggested by gene 
expression analysis, or any other therapy or treatment known or suspected to be effective. 

5 .7 CLINICAL TRIALS AND EPIDEMIOLOGICAL STUDIES 
[00201] The method of the present invention may also be used to assign individuals to 
categories within a clinical trial, epidemiological study or the like. For example, individuals 
may be distinguished according to a characteristic of a condition, such as the presence or 
absence of specific proteins (e.g., estrogen receptor) or tissue structures (e.g., lymph nodes), 
and with prognosis, and the results of the trial correlated with prognosis. In a specific 
example, the condition is breast cancer, the characteristic is the presence of the estrogen 
receptor, and the outcome is prognosis is the expected reoccurrence or non-reoccurrence of 
metastases within a given period, for example, five years, after initial diagnosis. In another 
specific example, the condition is obesity, the characteristics are 24-hour energy expenditure, 
and the prognosis is the expected occurrence of heart disease or diabetes. In another specific 
example, the condition is a neurodegenerative disease, the characteristic is exposure to a 
particular range of concentration of an environmental toxin, and the prognosis is expected 
occurrence or degree of loss of motor function. In each case, the characteristics and expected 
outcome are used to assign the individual to a category within a clinical trial or 
epidemiological study. 

[00202] Thus, the invention provides a method for assigning an individual to one of a 
plurality of categories in a clinical trial, comprising classifying the individual into one of a 
plurality of condition categories differentiated by at least one genotypic or phenotypic 
characteristic of the condition; determining the level of expression, in a sample derived from 
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said individual, of a plurality of genes informative for said condition category; determining 
whether said level of expression of said plurality of genes indicates that the individual has a 
good prognosis or a poor prognosis; and assigning the individual to a category in a clinical 
trial on the basis of prognosis. 

[00203] In a specific embodiment, the invention provides a method of assigning an 
individual to a category in a breast cancer clinical trial, said method comprising: (a) 
classifying said individual as ERT, BRCA1, EE."", sporadic; ER+, ER/AGE high; ER+, 
ER/AGE low, LN+; or ER+, ER/AGE low, LIST; (b) determining for said individual the level 
of expression of at least two genes for which markers are listed in Table 1 if said individual 
is classified as ER", BRCA1; Table 2 if said individual is classified as ER~ sporadic; Table 3 
if said individual is classified as ER+, ER/AGE high; Table 4 if said individual is classified 
as ER+, ER/AGE low, LN+; or Table 5 if said individual is classified as ER+, ER/AGE low, 
LN"; (c) determining whether said individual has a pattern of expression of said at least two 
genes that correlates with a good prognosis or a poor prognosis; and (d) assigning said 
individual to at least one category in a clinical trial if said individual has a good prognosis, 
and assigning said individual to a second category in said clinical trial if said individual has a 
poor prognosis. In a more specific embodiment, said individual is additionally assigned to a 
category in said clinical trial on the basis of the classification of said individual as determined 
in step (a). In another more specific embodiment, said individual is additionally assigned to a 
category in said clinical trial on the basis of any other clinical, phenotypic or genotypic 
characteristic of breast cancer. In another more specific embodiment, the method 
additionally comprises determining in said cell sample the level of expression, relative to a 
control, of a second plurality of genes for wtdch markers are not found in Tables 1-5, wherein 
said second plurality of genes is informative for prognosis of breast cancer, and determining 
from the expression of said second plurality of genes, in addition to said first plurality of 
genes, whether said individual has a good prognosis or a poor prognosis. 

5.8 KITS 

[00204] The present invention further provides for kits comprising the marker sets described 
above. The components of the kits of the present invention are preferably contained in sealed 
containers. In a preferred embodiment, the kit comprises a microarray ready for 
hybridization to target polynucleotide molecules. In specific embodiments, the kit may 
comprise any of the microarrays described In detail in Section 5.5.2. Where proteins are the 
target molecules, the kit preferably comprises a plurality of antibodies for binding to specific 
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condition-related proteins, and means for identifying such binding (e.g., means for 
performing a sandwich assay, ELISA, RIA, etc.). Such antibodies may be provided, for 
example, individually or as part of an antibody array. The kit may additionally comprise 
software for the data analyses described above, as described in detail in Section 5.9. The kit 
preferably contains one or more control samples. Such a control sample may be an artificial 
population of marker-related or marker-derived polynucleotides suitable for hybridization to 
a microarray, wherein the markers are related to or relevant to the condition of interest (for 
example, breast cancer). The control may also, or alternatively, be a set of expression values 
stored on a computer disk or other storage mediixm. 

[00205] The kits of the invention may be primarily diagnostic in nature; that is, they may 
assist a physician or researcher in determining a characteristic, for example, the prognosis, of 
a condition of interest, the likely response to a therapeutic regimen, the likely outcome of 
exposure to an environmental condition, such as toxin exposure, etc. The kits of the 
invention may also be used to classify individuals, for example, to place individuals into 
different groups in a clinical trial. The use of each kit is determined by the markers, 
microarray s, controls, etc. included. 

[00206] COMPUTER-FACILITATED ANALYSISThe analytic methods described in the 
previous sections can be implemented by use of the following computer systems and 
according to the following programs and methods. A computer system comprises internal 
components linked to external components. The internal components of a typical computer 
system include a processor element interconnected with a main memory. For example, the 
computer system can be based on an Intel 8086-, 80386-, 80486-, Pentium™, or Pentiums- 
based processor with preferably 32 MB or more of main memory. The computer system may 
also be a Macintosh or a Macintosh-based system, but may also be a minicomputer or 
mainframe. 

[00207] The external components preferably include mass storage. This mass storage can be 
one or more hard disks (which are typically packaged together with the processor and 
memory). Such hard disks are preferably of 1 GB or greater storage capacity. Other external 
components include a user interface device, which can be a monitor, together with an 
inputting device, which can be a "mouse", or other graphic input devices, and/or a keyboard. 
A printing device can also be attached to the computer. 

[00208] Typically, a computer system is also linked to network link, which can be part of an 
Ethernet link to other local computer systems, remote computer systems, or wide area 
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communication networks, such as the Internet. This network link allows the computer 
system to share data and processing tasks with other computer systems. 
[00209] Loaded into memory during operation of this system are several software 
components, which are both standard in the art and special to the instant invention. These 
software components collectively cause the computer system to function according to the 
methods of this invention. These software components are typically stored on the mass 
storage device. A software component comprises the operating system, which is responsible 
for managing computer system and its network interconnections. This operating system can 
be, for example, of the Microsoft Windows® family, such as Windows 3.1, Windows 95, 
Windows 98, Windows 2000, or Windows NT, or may be of the Macintosh OS family, or 
may be UNIX, a UNIX derivative such as LINUX, or an operating system specific to a 
minicomputer or mainframe. The software component represents common languages and 
functions conveniently present on this system to assist programs implementing the methods 
specific to this invention. Many high or Ioav level computer languages can be used to 
program the analytic methods of this invention. Instructions can be interpreted during run- 
time or compiled. Preferred languages include C/C++, FORTRAN and JAVA. Most 
preferably, the methods of this invention axe programmed in mathematical software packages 
that allow symbolic entry of equations and high-level specification of processing, including 
some or all of the algorithms to be used, thereby freeing a user of the need to procedurally 
program individual equations or algorithms. Such packages include Mathlab from 
Mathworks (Natick, MA), Mathematica® from Wolfram Research (Champaign, IL), or S- 
Plus® from Math Soft (Cambridge, MA). Specifically, the software component includes the 
analytic methods of the invention as programmed in a procedural language or symbolic 
package. 

[00210] The software to be included with the kit comprises the data analysis methods of the 
invention as disclosed herein. In particular, the software may include mathematical routines 
for marker discovery, including the calculation of similarity values between clinical 
categories {e.g., prognosis) and marker expression. The software may also include 
mathematical routines for calculating the similarity between sample marker expression and 
control marker expression, using array-generated fluorescence data, to determine the clinical 
classification of a sample. 

[0021 1] Additionally, the software may also include mathematical routines for determining 
the prognostic outcome, and recommended therapeutic regimen, for an individual with a 
condition of interest. In the specific example of breast cancer, the mathematical routines 
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would determine the prognostic outcome and recommended therapeutic regimen for an 
individual having breast cancer. Such breast cancer-specific software would include 
instructions for the computer system's processor to receive data structures that include the 
level of expression of five or more of the marker genes listed in any of Tables 1-5 in a breast 
cancer tumor sample obtained from the breast cancer patient; the mean level of expression of 
the same genes in a control or template; and the breast cancer patient's clinical information, 
including age, lymph node status and ER status. The software may additionally include 
mathematical routines for transforming the hybridization data and for calculating the 
similarity between the expression levels for the marker genes in the patient's breast cancer 
tumor sample and a control or template. In a specific embodiment, the software includes 
mathematical routines for calculating a similarity metric, such as a coefficient of correlation, 
representing the similarity between the expression levels for the marker genes in the patient's 
breast cancer tumor sample and the control or template, and expressing the similarity as that 
similarity metric. 

[00212] The software preferably would include decisional routines that integrate the patient's 
clinical and marker gene expression data, and recommend a course of therapy. In one 
embodiment, for example, the software causes the processor unit to receive expression data 
for prognosis-related genes in the patient's tumor sample, calculate a metric of similarity of 
these expression values to the values for the same genes in a template or control, compare 
this similarity metric to a pre-selected similarity metric threshold or thresholds that 
differentiate prognostic groups, assign the patient to the prognostic group, and, on the basis of 
the prognostic group, assign a recommended therapeutic regimen. In a specific example, the 
software additionally causes the processor unit to receive data structures comprising clinical 
information about the breast cancer patient- In a more specific example, such clinical 
information includes the patient's age, estrogen receptor status, and lymph node status. 
[00213] The software preferably causes the processor unit to receive data structures 
comprising relevant phenotypic and/or genotypic characteristics of the particular condition of* 
interest, and/or of an individual having that condition, and classifies the individual into a 
condition subset according to those characteristics. The software then causes the processor to 
receive values for subset-specific markers, to calculate a metric of similarity of the values 
associated with those markers (e.g., level, abundance, activity, etc.) from the individual to a 
control, compare this similarity metric to a pre-selected similarity metric threshold or 
thresholds that differentiate prognostic groups, assign the patient to a prognostic group, and, 
on the basis of the prognostic group, assign a recommended therapeutic regimen. In the 
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specific example of breast cancer and a breast cancer patient, the software, in one 
embodiment, causes the processor unit to receive data structures comprising the patient's age, 
estrogen receptor status, and lymph node status, and on the basis of this data, to classify the 
patient into one of the following patient subsets: ER~~, sporadic; ER , BRCA1 ; ER+, 
AR/AGE high; ER+, ER/AGE low, LN+; or ER+, ER/AGE low, LN~. The software then 
causes the processor to receive expression values for subset-specific prognosis-informative 
gene expression in the patient's tumor sample, calculate a metric of similarity of these 
expression values to the values for the same genes in a patient subset-specific template or 
control, compare this similarity metric to a pre-selected similarity metric threshold or 
thresholds that differentiate prognostic groups, assign the patient to the prognostic group, 
and, on the basis of the prognostic group, assign a recommended therapeutic regimen. 
[00214] Where the control is an expression template comprising expression values for 
marker genes within a group of patients, e.g., breast cancer patients, the control can comprise 
either hybridization data obtained at the same time {i.e., in the same hybridization 
experiment) as the patient's individual hybridization data, or can be a set of hybridization or 
marker expression values stores on a computer, or on computer-readable media. If the latter 
is used, new patient hybridization data for the selected marker genes, obtained from initial or 
follow-up tumor samples, or suspected tumor samples, can be compared to the stored values 
for the same genes without the need for additional control hybridizations. However, the 
software may additionally comprise routines for updating the control data set, e.g., to add 
information from additional breast cancer patients or to remove existing members of the 
control data set, and, consequently, for recalculating the average expression level values that 
comprise the template. In another specific embodiment, said control comprises a set of 
single-channel mean hybridization intensity values for each of said at least five of said genes, 
stored on a computer-readable medium. 

[00215] Clinical data relating to a breast cancer patient, or a patient having another type of 
condition, and used by the computer program products of the invention, can be contained in a 
database of clinical data in which information, on each patient is maintained in a separate 
record, which record may contain any information relevant to the patient, the patient's 
medical history, treatment, prognosis, or participation in a clinical trial or study, including 
expression profile data generated as part of an initial diagnosis or for tracking the progress of 
the condition, for example, breast cancer, during treatment. 

[00216] Thus, one embodiment of the invention provides a computer program product for 
classifying a breast cancer patient according to prognosis, the computer program product for 
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use in conjunction with a computer having a memory and a processor, the computer program 
product comprising a computer readable storage medium having a computer program 
mechanism encoded thereon, wherein said computer program product can be loaded into the 
one or more memory units of a computer and causes the one or more processor units of the 
computer to execute the steps of (a) receiving a first data structure comprising said breast 
cancer patient's age, ER status, LN status and tumor type; (b) classifying said patient as ER~ 
sporadic; ER~ BRCA1; ER+, ER/AGE high; ER+, ER/AGE low, LN+; or ER+, ER/AGE 
low, LN~~; (c) receiving a first data structure comprising the level of expression of at least two 
genes in a cell sample taken from said breast cancer patient wherein markers for said at least 
two genes are listed in Table 1 if said patient is classified as ER~, sporadic; Table 2 if said 
patient is classified as ER", sporadic; Table 3 if said patient is classified as ER+, ER/AGE 
high; Table 4 if said patient is classified as ER+, ER/AGE low, LNT-J-; or Table 5 if said 
patient is classified as ER+, ER/AGE high, LN - ; (d) determining the similarity of the level of 
expression of said at least two genes to control levels of expression of said at least two genes 
to obtain a patient similarity value; (e) comparing said patient similarity value to selected first 
and second threshold values of similarity of said level of expressioxi of said genes to said 
control levels of expression to obtain first and second similarity threshold values, 
respectively, wherein said second similarity threshold indicates greater similarity to said 
control levels of expression than does said first similarity threshold; and (f) classifying said 
breast cancer patient as having a first prognosis if said patient similarity value exceeds said 
first and said second threshold similarity values, a second prognosis if said patient similarity 
value exceeds said first threshold similarity value but does not exceed said second threshold 
similarity value, and a third prognosis if said patient similarity value does not exceed said 
first threshold similarity value or said second threshold similarity value. In a specific 
embodiment of said computer program product, said first threshold value of similarity and 
said second threshold value of similarity are values stored in said computer. In another more 
specific embodiment, said first prognosis is a "very good prognosis," said second prognosis is 
an "intermediate prognosis," and said third prognosis is a "poor prognosis," and wherein said 
computer program mechanism may be loaded into the memory and further cause said one or 
more processor units of said computer to execute the step of assigning said breast cancer 
patient a therapeutic regimen comprising no adjuvant chemotherapy if the patient is lymph 
node negative and is classified as having a good prognosis or an intermediate prognosis, or 
comprising chemotherapy if said patient has any other combination of lymph node status and 
expression profile. In another specific embodiment, said computer program mechanism may 
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be loaded into the memory and further cause said one or more processor units of the 
computer to execute the steps of receiving a data structure comprising clinical data specific to 
said breast cancer patient. In a more specific embodiment, said single-channel hybridization 
intensity values are log transformed. The computer implementation of the method, however, 
may use any desired transformation method. In another specific embodiment, the computer 
program product causes said processing unit to perform said comparing step (e) by 
calculating the difference between the level of expression of each of said genes in said cell 
sample taken from said breast cancer patient and the level of expression of the same genes in 
said control. In another specific embodiment, the computer program product causes said 
processing unit to perform said comparing step (e) by calculating the mean log level of 
expression of each of said genes in said control to obtain a control mean log expression level 
for each gene, calculating the log expression level for each of said genes in a breast cancer 
sample from said breast cancer patient to obtain a patient log expression level, and 
calculating the difference between the patient log expression level and the control mean log 
expression for each of said genes. In another specific embodiment, the computer program 
product causes said processing unit to perform said comparing step (e) by calculating 
similarity between the level of expression of each of said genes in said cell sample taken from 
said breast cancer patient and the level of expression of the same genes in said control, 
wherein said similarity is expressed as a similarity value. In more specific embodiment, said 
similarity value is a correlation coefficient. The similarity value may, however, be expressed 
as any art-known similarity metric. 

[00217] Of course, the above breast cancer-specific examples are not limiting; analogous 
computer systems, software, and data analysis methods may be utilized for any condition of 
interest. For example, analogous software may be used to determine the prognosis of any 
other type of cancer, or of any other non-cancer diseases or conditions, using markers, 
expression level data and controls specific for that cancer, non-cancer disease or condition. 
[00218] In an exemplary implementation, to practice the methods of the present invention, a 
user first loads experimental data into the computer system. These data can be directly 
entered by the user from a monitor, keyboard, or from other computer systems linked by a 
network connection, or on removable storage media such as a CD-ROM, floppy disk (not 
illustrated), tape drive (not illustrated), ZIP® drive (not illustrated) or through the network. 
Next the user causes execution of expression profile analysis software which performs the 
methods of the present invention. 
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[00219] In another exemplary implementation, a user first loads experimental data and/or 
databases into the computer system. This data is loaded into the memory from the storage 
media or from a remote computer, preferably from a dynamic geneset database system, 
through the network. Next the user causes execution of software that performs the steps of 
the present invention. 

[0022O] Additionally, because the data obtained and analyzed in the software and computer 
system products of the invention may be confidential, the software and/or computer system 
preferably comprises access controls or access control routines, such as password protection 
and preferably, particularly if information is to be transmitted between computers, for 
example, over the Internet, encryption of the data by a suitable encryption algorithm (e.g., 
PGP). 

[0022 1] Alternative computer systems and software for implementing tlie analytic methods 
of this invention will be apparent to one of skill in the art and are intended to be 
comprehended within the accompanying claims. In particular, the accompanying claims are 
intended to include the alternative program structures for implementing the methods of this 
invention that will be readily apparent to one of skill in the art. 

6. EXAMPLE: IDENTIFICATION OF PHENOTYPIC SUBSETS AJS1D 

INFORMATIVE GENESETS FOR EACH 
[00222] Materials and Methods 
Tumor Samples : 

[00223] 3 1 1 cohort samples were collected from breast cancer patients - Selection criteria for 
sporadic patients (i.e., those not identified as having a BRCAl-type tumor; n = 291) included: 
primary invasive breast carcinoma less than 5 cm (Tl or T2); no axillary metastases (NO); 
age at diagnosis of less than 55 years; calendar year of diagnosis 1983- 1996; and no previous 
malignancies. All patients were treated by modified radical mastectomy or breast-conserving 
treatment. See van't Veer et ah, Nature 415:530 (2002). Selection criteria for hereditary 
(i.e., -BRCAl-type; n = 20) tumors included: carriers of germline mutation in BRCA1 or 
BRCA2, and primary invasive breast carcinoma, van't Veer, supra. Additionally, for 
development of a classifier for the BRCA1 group, 14BRCA1 samples previously identified 
(see van't Veer, supra) were added to the 20 BRCA1 type samples to increase sample size. 
Those 14 samples also satisfy the conditions that they are ER negative and age less than 55 
years old. 

[00224] Data analysis: 
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[00225] Sample sub-grouping: As shown in FIG. 1, tumor samples were first divided into 
ER+ and ER — branches since this is the dominant gene expression pattern. In the ER~ 
branch, the samples were further divided into "BRCA1 mutation like" and "Sporadic like" 
categories using the expression templates and 100 genes previously identified as optimal for 
determining BRCA1 status. See van't Veer et a/., Nature 415:530 (2O02). In the ER+ 
category, samples were divided by ER vs. age distribution (see below) into two groups, 
"ER/AGE low" and "ER/AGE high." Within the "ER/AGE low" group, samples were 
further divided according to the lymph node status into two sub-groups: lymph node negative 
(0 lymph nodes; LN— ) and positive (> 0 lymph nodes; LN+) group. 

[00226] The result of these divisions was five distinctive sub-groups: "ER - , sporadic" (n = 
52), "ER~, BRCA1" (n = 34), "ER+, ER/AGE high" (n - 83), "ER+ 9 ER/AGE low, LN""" (n 
= 81), and "ER+, ER/AGE low, LN+" (n = 75). A few samples with, a specific ER vs. age 
distribution in "ER+, ER/AGE low, LN+" group were further excluded to develop a 
classifier, see below for details. 

[00227] Estrogen receptor level : Estrogen receptor gene expression level was measured by a 
60mer oligo-raucleotide on a microarray. Since every individual sample was compared to a 
pool of all samples, the ratio to pool was used to measure the relative level. A threshold of 
-0.65 on logio(ratio) was used to separate the ER+ group from ER~ group. See van't Veer et 
ah, Nature 41 5:530 (2002). 

[00228] Groxiping by ER vs. age distribution : Samples were not uniformly distributed in ER 
vs. age space among the ER+ samples (FIG. 2). First, it appeared that the ER level increases 
with age, as there were few samples from young individuals having a high ER expression 
level. For example, in the 35 to 40 years age group, samples having a log(ratio) of ER > 0.2 
are relatively few as compared to the 40 to 45 age group. In the set of samples used, the 40 < 
age < 45 group contains 30 samples having log(ratio) ER values between -0.2 to 0.2, and 28 
samples having values greater than 0.2, whereas the 35 < age < 40 group includes 24 samples 
with values between -0.2 to 0.2, but only 6 samples with values of greater than 0.2 (Fisher's 
exact test P-value: 1%). The increasing ER level with age may simply due to the fact that 
estrogen levels decrease with age, and the estrogen receptor level rises in compensation. 
[00229] There also appeared to be at least two groups of patients, as indicated by the solid 
line separating the two in FIG. 2A. A bimodality test of the separation indicated by the solid 
line yielded P -value < 10" 4 . Each of these two groups has its own trend between the ER level 
and age. The solid line can be approximated by ER= 0.1 (age -42.5). Patients having values 
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above the solid line are referred to as the "ER/AGE high" group, and the patients below the 
line as the "ER/AGE low" group. 
[00230] Prognosis in each group: 

[00231] Feature selection and performance evaluation : For the prognosis in each group, non- 
informative genes were filtered in each group of patients. Specifically, only genes with 
|log, 0 (ratio)| > logio(2) and P-value (for log(ratio) + 0) < 0.01 in more than 3 experiments 
were kept. This step removed all genes that never had any significant change across all 
samples. Trie second step used a leave-one-out cross validation (LOOCV) procedure to 
optimize the number of reporter genes (features) in the classifier and to estimate the 
performance of the classifier in each group. The feature selection was included inside the 
loop of each LOOCV process. The final "optimal" reporter genes were selected using all of 
the "training samples" as the result of "re-substitution" because one classifier was needed for 
each group. 

[00232] Selection of training samples : Only the samples from patients who had metastases 
within 5 years of initial diagnosis (3 years for "ER~ sporadic" samples; i.e., the "poor 
outcome" group), or who were metastases-free with more than 5 years of follow-up time (i.e., 
the "good outcome" group, were used as the training set. Becaiase the average expression 
levels for informative genes among patients who were metastasis-free, or who had early 
metastases, were used as expression templates for prediction, the training samples for the 
ER+ samples were further limited to those samples that could also be correctly classified by 
the first round of LOOCV process. For the "ER~ sporadic" samples, no such iteration was 
done because no improvement was observed. For the "ER~, BRCA1" samples, an iteration 
was done, but the training samples in the second iteration were limited to the correctly 
predicted good outcome samples from the first round of LOOC^V, and all the poor outcome 
samples with metastases time less than 5 years. Further limitation of the poor outcome 
samples was not performed because of the small number of poor samples and the absence of 
improvement by such limitation. In the first round of LOOCV, except for the "ER~, 
sporadic" group, the number of features was fixed at 50 genes. A patient was predicted to 
have a favorable outcome, that is, no metastases within five years of initial diagnosis, if the 
expression of the reporter genes in a sample from the individual was more similar to the 
"average good profile" than the "average poor profile", and a poor outcome, that is, a 
metastasis ^vithin five years, if the expression of the reporter genes in the sample was more 
similar to the "average poor profile" than the "average good profile". 
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[00233] The justification for such an iteration operation is threefold. First, biologically there 
are always a few individuals with specific reasons (different from the vast majority) to stay 
metastases free or to develop metastases. Second, statistically, most groups of patients 
include outliers that don't follow the distribution of the majority of samples. Third, 
methodologically, the iteration operation is very similar to the idea of "boosting", but instead 
of increasing the weights of the samples predicted wrong, emphasis is placed on the well 
behaved samples for selecting features and training the classifier. Since this process was 
used to select "training samples", and the performance was evaluated using the LOOC*V 
(including the feature selection) after the training sample being fixed, there is no issue of 
over- fitting involved in our procedures. This method of iteration is thus more likely to reveal 
the dominant mode to metastases within each group. 

[00234] Error rate and odds ratio, threshold in the final LOOCV : Unless otherwise stated, 
the error rate was the average error rate from two populations: (1) the number of poor 
outcome samples misclassified as good outcome samples, divided by the total number of poor 
outcome samples; and (2) the total number of good outcome samples misclassified as poor 
outcome samples, divided by the total number of good samples. Two odds ratios were 
reported for a given threshold: (1) the overall odds ratio and (2) the 5 year odds ratio. The 5 
year odds ratio was calculated from samples from individuals that were metastases free for 
more than five years, and who experienced metastasis within 5 years. The threshold w^as 
applied to corl — cor2, where "cor 1" stands for the correlation to the "average good profile" 
in the training set, and "cor2" stands for the correlation to the "average poor profile" in the 
training set. 

[00235] The threshold in the final round of LOOCV was defined using the following steps: 
(1) For each of the N sample i left out for training, features based on the training set were 
selected, (2) given a feature set, arx incomplete LOOCV withN-1 samples was performed 
(only the "average poor profile" and "average good profile" is varied depending on wlxether 
the left out sample is in the training set or not), (3) the threshold based on the minimum error 
rate from N-l samples was determined, and that threshold was assigned to sample / in step 
(1), (4) the median threshold from all N samples was taken, and designated the final 
threshold. FIGS. 3-7 present detailed information about classifiers for the 5 groups: "ER~ 
sporadic", "ER~ BRCA1", "ER+, ER/age high", "ER+, ER/age low, LN~", "ER+, ERL/age 
low, LN+". Tables 1-5 (see Section 5.3) list the final optimal reporter genes for each of the 5 
classifiers for each of the five patient subsets. Table 6, below, summarizes the performance 
of each of the five classifiers together with thresholds used in each classifier. 
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[00236] Table 6. Performance of classifiers for each patient subset. 



Classifier 


Optimal # 
of Genes 
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36 
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ER-, sporadic 


20 


-0.01 


Overall 


52 


18 
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7 


29 


7.35 


2.16- 
25.04 


5 year 


45 


16 


5 


6 


18 


9.60 


2.45- 
37.58 


ER-, BRCA1 


10 


-0.37 


Overall 


34 


6 


3 


3 


22 


14.67 


2.34- 
92.11 


5 year 


22 


6 


1 


3 


-12 


24.00 


2.04- 
282.68 



[00237] TP: True positive 



[00238] FP: False positive 
[00239] FN: False negative 
[00240] TN: True negative 

[00241] Classification method: All classifiers described herein, feature selection and 
optimization were included inside the LOOCV loop. Classifier performance was based on 
the LOOCV results. The profile based on the selected features from each patient was 
compared to the "average good profile" and "average poor profile" (by correlation) to 
determine its predicted outcome. 

[00242] Correlation calculation: The correlation between each gene's expression log(ratio) 
and the endpoint data (final outcome) was calculated using the Pearson's correlation 
coefficient. The correlation between each patient's profile and the "average good profile" 
and "average poor profile" ^was the cosine product (no mean subtraction). 
[00243] Results: 

[00244] The comprehensive prognosis strategy was employed on microarray expression 
profiles of 311 patients diagnosed before age 55 that were all part of previous studies 
establishing and validating a 70-gene prognosis profile. See van ! t Veer et ah, JSlature 
415:530 (2002); van de Vij ver et al.,N. Engl J. Med. 347:1999 (2002). In addition, 14 
known BRCA1 samples from the Nature study were included in defining the prognosis 
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classifier for the BRCA1 group. The overview of the stratifications is shown in FIG. 1 . In 
each of the patient subsets, prognosis classifiers were developed and performance was 
evaluated by leave-one-out cross-validation. The biological make up of each of the 
classifiers was also examined. 

[00245] During the process to decide whether a particular clinical parameter should be used 
for the next stratification, our objectives were twofold: (1) identification of homogeneous 
prognosis patterns; and/or (2) improved prognosis in the subsets. Tfciere is a subtle balance 
between these two objectives because smaller groups will likely lead to uniform patterns 
within the group but have increasingly limited predictive power. With the exception of the 
BRCA1 subset, each group in our stratification contained 50 or more samples. 
[00246] The first layer of stratification was based on the estrogen receptor level. It was 
previously observed that estrogen receptor expression has a dominant effect on overall gene 
expression in breast cancer as seen in hierarchical clustering, van 't Veer et aL, Nature 
415:530 (2002); Perou et al., Nature 406:747 (2000); Gruvberger et aL, Cancer Res. 61 :5979 
(2001). In previous analysis up to 2500 genes were significantly correlated with ER 
expression levels in tumor. See, van ! t Veer et aL, Nature 415:530 C2002). According to the 
threshold defined previously (van de Vijver et aL, N Engl. J. Med. 347:1999 (2002)), 
samples were first divided into two groups according to the estrogen receptor level as 
measured by the oligo probe (accession number: NM 000125) on the array; samples with 
log(ratio) > -0.65 belong to the ER+ group, and the rest belong to ER" group). This resulted 
in 239 samples in the ER+ group and 72 samples in the ER~ group. 

[00247] In the ER+ branch it was observed that when displaying ER expression level as a 
function of age, at least two subgroups appeared to exist. (In general, any bimodality in the 
clinical data is useful.) The tumors were stratified according this bimodality {see FIG. 2). 
The group of ER-*- patients having a high ER/AGE ratio was designated the "ER/AGE high" 
group (83 samples), and the remaining group of patients was designated "ER/AGE low" 
group (156 samples). 

[00248] Within the "ER/age high" group, a group of prognosis reporter genes that highly 
correlated with th.e outcome is identified (see Table 3). Moreover, the expression of these 
genes appeared to be very homogeneous, as indicated by high similarity in expression among 
those genes. See FIG 2A. Leave-one-out cross validation including reporter selection 
yielded an odds ratio of 14.6 (95%CI: 4.7-45.4) and 5 year odds ratio of 24.0 (95%CI: 6.0- 
95.5). Examination of those reporter genes reveals they are mostly the cell cycle genes which 
are highly expressed in the poor outcome tumors. It is worth noting that even though this 
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group includes LN+ and LN- individuals, and mixed treatment, the incidence of distant 
metastases is predicted by a biologically uniform set of genes, possibly indicating that 
proliferation is the prime driving force for disease progression. Also even though variation in 
these genes is observed in other tumor subgroups this is generally not correlated with 
outcome in those settings (see below). 

[00249] In the "ER/age low" group, no predictive pattern was found in the whole group; thus, 
the samples were further stratified into LN- (81 samples, referred to as "ER/age low LN~") 
and LN+ (75 samples, referred to as "ER/age low LN+-") group. 

[00250] Within the "ER/age low LN~" group, a group of genes was identified that was 
uniformly co-regulated, and which correlated with the outcome. Leave-one-out cross- 
validation (including feature selection) yielded an odds ratio of 21.4 (95% CI: 6.0-76.5) and 5 
year odds ratio of 29.2 (95% CI: 6.7-126.3). This group of genes is also enriched for 
individual biological functions (see below). 

[00251] For the "ER/age low LN+" subset, an informative set of genes (see Table 4) was 
obtained after exclusion of several samples from older individuals having low ER levels. 
These samples are indicated in FIG. 2A as those lying below the dashed line (approximated 
as ER < 0. 1 * (age -50). 56 samples remained after the exclusion. This sample set allowed the 
identification of a group of genes with a highly homogeneous pattern that is useful for 
prognosis (overall odds ratio: 11.4 (2.5-50.9), 5 year odds ratio: 15.0 (2.6-87.6)). This 
suggests again that ER vs. age is an important combination for stratifying breast cancer 
patients. The reporter genes involved in this classifier also correlated with the clinical 
measure of the degree of lymphocytic infiltration (data not shown). The prediction in this 
group was not as strong as other positive groups, which may indicate the primary tumor 
carries weaker information about the metastases for this group of patients, and the metastases 
may be started from or influenced by tumors already in lymph nodes. 

[00252] In the ER" branch, because a portion of the samples are "BRCAl-likz" it is natural 
to divide the samples into "BRCAl-like" and "sporadic like". To perform the classification, 
the BRCA1 /sporadic tumor type classifier described hx Roberts et aL, "Diagnosis and 
Prognosis of Breast Cancer Patients," International Pixtlication No. WO 02/103320, which is 
hereby incorporated by reference in its entirety, to segregate the ER~ cohort samples. 52 out 
of the 72 ER~~ samples were found to be "sporadic like" and 20 were found to be "BRCA1- 
like". Interestingly, the "sporadic like" group was enriched for erbb2 mutations (data not 
shown). 
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[00253] Within the "ER~ sporadic" group, no homogeneous prognosis pattern was 
identified; however, 20 genes were identified that are highly predictive of the tumor outcome 
(see Table 2). Leave-one-out cross-validation including feature selection yielded an odds 
ratio of 7.4 (95% CI 2.2-25.0) and 5 year odds ratio 9.6 (2.5 - 37.6). This result represents a 
significant improvement in prognosis compared to the previously-identified 70 gene 
prognosis classifier (see Roberts et al, International Publication No. WO 02/103320; van f t 
Veer et al 9 Nature 415:530 (2002)) which has no within-group prognostic power for the ER~ 
patient subset. The fact that 20 genes predict outcome and that there is no homogeneous (and 
apparent biological) pattern in this group probably indicates multiple mechanisms of 
metastasis in this group. Gene annotation indicates that genes included may be involved in 
invasion, energy metabolism and other functions. 

[00254] For the "ER~ BRCA1 -like" group, we added 14 BRCA1 mutation carrier samples 
from a previous study were added to increase the number of samples. Those 14 extra 
samples also satisfied the following selection criteria: ER negative and age less than 55 y-ears. 
The leave-one-out cross validation process identified 10 genes that are predictive of final 
outcomes. The overall odds ratio is 14.7 (95% CI: 2.3-92.1) and the 5 year odds ratio is 24.0 
(95% CI: 2.0-282.7). 

[00255] Because no homogeneous gene expression patterns were found in ER" branch, the 
predictive power of those genes was further validated. One means of further validation was 
to review the different classifier gene sets for biological interpretations and to identify genes 
within each classifier that gave indications as to the origins of the tumors. 
[00256] The "ER+, ER/AGE high" groixp yielded a classifier highly enriched for cell cycle 
genes with both Gl/S and G2/M phases represented. In this group, over-expression of 46 of 
the 50 genes was associated with disease progression including all the known cell cycle 
genes. This is consistent with rapid growth being the determinant of metastatic potential. 
Four genes in this classifier were anti-correlated with outcome and cell cycle. One of these 
genes encodes follistatin, which binds to and inhibits activin and other members of the TGFp 
family (Lin et aL, Reproduction 126:133 (2003)), the members of which have many 
functions, including growth stimulation. Tumor grade also accurately predicted metastatic 
potential in this group (overall odds ratio: 5.9, 95% CI: 2.0-18.0, 5 year odds ratio: 12.5, 95% 
CI: 2.6-59.3) and was also correlated with the expression level of these genes, which is 
consistent with rate of growth being the primary determinant of disease progression. This set 
of genes had a significantly lower correlation with outcome in the other patient subsets, even 
though coordinate and similarly variable expression was seen. For example, many tumoxs in 
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the 4<i ER~, sporadic" group had high cell cycle and low FST expression, but the expression of 
these genes in these groups was minimally correlated with outcome, indicating that growth 
was not the primary determinant of outcome here (see FIGS. 8 A and 8B). 
[00257] The ER+, ER/ACrE low, LN~ group yielded a classifier rich in both genes for 
glycolytic enzymes (12 of 56) and genes induced by hypoxia and/or angiogenesis (14 of 56) 
with 5 genes falling into both categories. These genes were positively correlated with poor 
outcome, implying that energy metabolism (glycolysis), angiogenesis and adaptation to 
hypoxia were critical pathways in this subgroup of tumors. None of these genes appeared in 
the classifiers for the other patient subsets, and there was a much reduced predictive value of 
these genes in the other tumors, even though coordinate and similarly variable expression was 
seen (see FIG. 8C and 8D). 

[00258] The implication of the above analyses is that certain well known functions (growth, 
angiogenesis, energy metabolism) are important in certain tumor types and not in others, and 
therefore therapies that target these functions will be likely be similarly effective in some 
tumor subgroups and not in others. For example therapies that target cell cycle progression, 
such as taxol or the vinca alkaloids, may be optimally effective in the ER+ ? ER/AGE high 
group, where overexpression of cell cycle genes predominates in the classifier. In contrast, 
tumor subgroups in which variation in cell cycle expression is not correlated with outcome 
may be less sensitive to taxol or the vinca alkaloids. 

[00259] The "comprehensive prognosis" approach significantly improved the prediction error 
rate when compared with 70 gene classifier (Table 7). To make the comparison fair, we 
listed two sets of results from the 70 gene classifier. The first results from the use of the 
same threshold applied to edl the patient subsets (threshold previously optimized for false 
negative rate); the second one results from the use of a threshold optimized for each patient 
subset (optimized for average error rate). The comprehensive approach lowered the error rate 
by at least 6%. 

Table 7. Average error rate for the patient subset approach compared witih. the previously- 



described 70 gene classifier. 



Prognosis method 


over all error rate 


5 year error rate 


70 gene, fix thresh 


30.90% 


25.70% 


70 gene, opt thresh 


28.60% 


27.60% 


Comprehensive 


21.50% 


19.30% 



[00260] Fix thresh: use of a fixed threshold in the classifier as previously determined. 
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[00261] Opt threshold: use of a threshold optimized for each sub-group. For the "ER/Age 
low, LN+" subgroup, 56 samples used for developing the classifiex were included here, 
resulted in 306 samples in total. 
[00262] 

7. REFERENCES CITED 

[00263] All references cited herein are incorporated herein by reference in their entirety and 
for all purposes to the same extent as if each individual publication or patent or patent 
application was specifically and individually indicated to be incorporated by reference in its 
entirety for all purposes. 

[00264] Many modifications and variations of the present invention can be made without 
departing from its spirit and scope, as will be apparent to those skilled in the art. The specific 
embodiments described herein are offered by way of example only, and the invention is to be 
limited only by the terms of the appended claims along with the full scope of equivalents to 
which such claims are entitled. 
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WHAT IS CLAIMED IS : 

1 . A method of identifying a set of informative genes or markers for a 
condition comprising a plurality of phenotypic or genotypic characteristics, comprising: 

(a) classifying each of a plurality of* samples or individuals on the basis of 
one or more phenotypic or genotypic characteristics of said condition into a plurality 
of first classes; and 

(b) identifying within each of said first classes a first set of genes or 
markers informative for said condition 

wherein said first set of genes or markers within each of said first classes is unique to said 
class relative to other first classes. 

2. The method of claim 1 5 which further comprises additionally classifying into a 
plurality of second classes said samples or individuals in at least one of said first classes on 
the basis of a phenotypic or genotypic characteristic different that that used in said classifying 
step (a); and identifying within at least one of said second classes a second set of informative 
genes or markers, wherein said second set of informative genes or markers within each of 
said second classes is unique to said second class relative to other first and second classes. 

3 . A method of identifying a set of informative genes or markers for a condition 
comprising a plurality of phenotypic or genotypic characteristics, comprising: 

(a) classifying each of a plurality of samples or individuals on the basis of 
one or more phenotypic or genotypic characteristics into a plurality of first classes; 

(b) classifying at least one of said first classes into a plurality of second 
classes on the basis of phenotypic or genotypic characteristic different than that used 
in said classifying step (a); and 

(c) identifying within at least one of said first classes or said second 
classes a set of genes or markers informative for said condition, 

wherein said second set of genes or markers is unique to said class relative to other first and 
second classes. 
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4. A method of identifying a set of informative genes or markers for a condition 

comprising a plurality of phenotypic or genotypic characteristics, comprising: 

(a) selecting a first characteristic from said plurality of phenotypic or 
genotypic characteristics; 

(b) identifying at least two first condition classes differentiable by said 
first characteristic; 

(c) selecting a plurality of individuals classifiable into at least one of said 
first condition classes; and 

(d) identifying in samples derived from each of said plurality of 
individuals a set of genes or markers informative for said condition within said at lea.st 
one of said first condition classes. 

5. A method of classifying an individual with a condition as having a good 
prognosis or a poor prognosis, comprising: 

(a) classifying said individual into one of a plurality of patient classes, said 
patient classes being differentiated by one or more phenotypic, genotypic or clinical 
characteristics of said condition; 

(b) determining the level of expression of a plurality of genes or their 
encoded proteins in a cell sample taken from the individual relative to a control, said 
plurality of genes or their encoded proteins comprising genes or their encoded 
proteins informative for prognosis of the patient class into which said individual is 
classified; and 

(c) classifying said individual as having a good prognosis or a poor 
prognosis on the basis of said level of expression. 

6. The method of claim 5, wherein said condition is cancer, said good prognosis 
is the non-occurrence of metastases within five years of initial diagnosis, and said poor 
prognosis is the occurrence of metastases within five years of initial diagnosis. 
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7. The method of claim 5, wherein said control is the average level of expression 
of each of said plurality of genes or their encoded proteins across a plurality of samples 
derived from individuals identified as having a poor prognosis. 

8. The method of claim 7, in which said classifying step (c) is carried out by a 
method comprising comparing the level of expression of each of said plurality of genes or 
their encoded proteins to said average level of expression of each corresponding gene or its 
encoded protein in said control, and classifying said individual as having a poor prognosis if 
said level of expression correlates with said average level of expression of eacli of said genes 
or their encoded proteins in said control more strongly than would be expected by chance. 

9. The method of claim 5, wherein said control is the average level of expression 
of each of said plurality of genes or their encoded proteins across a plurality of samples 
derived from individuals identified as having a good prognosis. 

10. The method of claim 9, in which said classifying in step (c) is carried out by a 
method comprising comparing the level expression of each of said plurality of* genes or their 
encoded proteins to said average level of expression of each corresponding gene or its 
encoded protein in said control, and classifying said individual as having a good prognosis if 
said level of expression correlates with said average level of expression of eacli of said genes 
or their encoded proteins in said control more strongly than would be expected by chance. 

1 1 . The method of claim 5, wherein said plurality of patient classes comprises 
ER~, BRCAI individuals; ER~ sporadic individuals; ER+, ER/AGE high individuals; ER+, 
ER/AGE low, LN+ individuals; and ER+, ER/AGE low, LN _ individuals. 

12. A method of classifying a breast cancer patient as having a good prognosis or 
a poor prognosis comprising: 

(a) classifying said breast cancer patient as ER~ BRCAI; ETC, sporadic; 
ER+, ER/AGE high; ER+, ER/AGE low, LN+; or ER+, ER/AGE low, LN~; 

(b) determining the level of expression of a first plurality of genes in a cell 
sample taken from said breast cancer patient relative to a control, said first plurality of 
genes comprising two of the genes corresponding to the markers in Table 1 if said 
breast cancer patient is classified as ER~~, BRCAI; in Table 2 if said breast cancer 
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patient is classified as ER~ sporadic; in Table 3 if said breast cancer patient is 
classified as ER+, ER/AGE high; in Table 4 if said breast cancer patient is classified 
as ER+. ER/AGE low, LN+; or in Table 5 if said breast cancer patient is classified as 
ER+, ER/AGE low, LN~; and 

(c) classifying said breast cancer patient as having a good prognosis or a 
poor prognosis on the basis of the level of expression of said first plurality of genes, 

wherein said breast cancer patient is "ER/AGE high" if the ratio of the logio(ratio) of ER 
gene expression to age exceeds a predetermined value, and "ER/AGE low" if the ratio of the 
logio(ratio) of ER gene expression to age does not exceed said predetermined value. 

1 3 . The method of claim 1 2, wherein said control is the average level of 
expression of each of said plurality of genes in a plurality of samples derived from ER", 
BRCA1 individuals, if said breast cancer patient is ER", BRCA1; the average level of 
expression of each of said plurality of genes in a plurality of samples derived from ER"", 
sporadic individuals if said breast cancer patient is ER", sporadic; the average level of 
expression of each of said plurality of genes in a plurality of samples derived from ER+, 
ER/AGE high individuals, if said breast cancer patient is ER+, ER/AGE high; the average 
level of expression of each of said plurality of genes in a plurality of samples derived from 
ER+, ER7AGE low, LN+ individuals where said breast cancer patient is ER+, ER/AGE low, 
LN+; or the average level of expression of each of said plurality of genes in a plurality of 
samples derived from ER+, ER/AGE low, LN" individuals where said breast cancer patient is 
ER+, ER/AGE low, LN". 

14. The method of claim 13, wherein each of said individuals has a poor 
prognosis. 

1 5. The method of claim 13, wherein each of said individuals has a good 
prognosis. 

1 6. The method of claim 14, wherein said classifying step (c) is carried out by a 
method comprising comparing the level of expression of each of said plurality of genes or 
their encoded proteins in a sample from said breast cancer patient to said control, and 
classifying said breast cancer patient as having a poor prognosis if said level of expression 
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correlates with said average level of expression of the corresponding genes or their encoded 
proteins ixi said control more strongly than would toe expected by chance. 

1 7 . The method of claim 1 2, wherein said predetermined value of ER is calculate d 
as ER = 0.1(AGE - 42.5), wherein AGE is the age of said individual. 

1 8. The method of claim 12, wherein said individual is ER~ BRCA1, and said 
plurality of genes comprises two of the genes for which markers are listed in Table 1. 

1 9. The method of claim 12, wherein said individual is ER~, BRCA1, and said 
plurality of genes comprises all of the genes for wliich markers are listed in Table 1 . 

20. The method of claim 12, wherein s«aid individual is ER", sporadic, and said 
plurality of genes comprises two of the genes for xvhich markers are listed in Table 2. 

2 1 . The method of claim 12, wherein said individual is ER", sporadic, and said 
plurality of genes comprises all of the genes for wliich markers are listed in Table 2. 

22. The method of claim 12, wherein said individual is ER+, ER/AGE high, and 
said plurality of genes comprises two of the genes for which markers are listed in Table 3. 

23. The method of claim 12, wherein said individual is ER+, ER/AGE high, and 
said plurality of genes comprises all of the genes for which markers are listed in Table 3. 

24. The method of claim 12, wherein said individual is ER+, ER/AGE low, LN+-, 
and said plurality of genes comprises two of the genes for which markers are listed in Table 
4. 

25. The method of claim 12, wherein said individual is ER+, ER/AGE low, LN-4-, 
and said plurality of genes comprises all of the genes for which markers are listed in Table A. 

26. The method of claim 12, wherein said individual is ER+, ER/AGE low, LN~, 
and said plurality of genes comprises two of the genes for which markers are listed in Table 
4. 

27. The method of claim 12, wherein said individual is ER+, ER/AGE low, LN~, 
and said plurality of genes comprises all of the genes for which markers are listed in Table 4. 
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28. The method of claim 1 2, further comprising determining in said cell sample 
the level of expression, relative to a control, of a second plurality of genes for which markers 
are not found in Tables 1-5, wherein said second plurality of genes is informative for 
prognosis. 

29. A method for assigning an individual to one of a plurality of categories in a 
clinical trial, comprising: 

(a) classifying said individual as ER~ BRCA1, ERT, sporadic; ER+, 
ER/AGE high; ER+, ER/AGE low, LN+; or ER+, ER/AGE low, LIST; 

(b) determining for said individual the level of expression of at least two 
genes for which markers are listed in Table 1 if said individual is classified as ER~, 
BRCA1; Table 2 if said individual is classified as ER~ sporadic; Table 3 if said 
individual is classified as ER+, ER/AGE high; Table 4 if said individual is classified 
as ER+, ER/AGE low, LN+; or Table 5 if said individual is classified as ER+, 
ER/AGE low, LKT; 

(c) determining whether said individual has a pattern of expression of said 
at least two genes that correlates with a good prognosis or a poor prognosis; and 

(d) assigning said individual to one category in a clinical trial if said 
individual has a good prognosis, and assigning said individual to a second category in 
said clinical trial if said individual has a poor prognosis. 

30. The method of claim 29, wherein said individual is additionally assigned to a 
category in said clinical trial on the basis of the classification of said individual as determined 
in step (a). 

31. The method of claim 29, wherein said individual is additionally assigned to a 
category in said clinical trial on the basis of any other clinical, phenotypic or genotypic 
characteristic of breast cancer. 

32. The method of claim 29, further comprising determining in said cell sample 
the level of expression, relative to a control, of a second plurality of genes for which markers 
are not found in Tables 1-5, wherein said second plurality of genes is informative for 
prognosis of breast cancer, and determining from the expression of said second plurality of 
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genes, in addition to said first plurality of genes, whether said individual has a good 
prognosis or a poor prognosis. 

33 . A method of identifying a set of genes informative for a condition, said 
condition having a plurality of phenotypic or genotypic characteristics such that samples may 
be categorized by at least one of said phenotypic or genotypic characteristics into at least one 
characteristic class, said method comprising: 

(a) selecting a plurality of samples from individuals having said condition; 

(b) identifying a first set of genes informative for said characteristic class 
using said plurality of samples; 

(c) predicting the characteristic class of each of said plurality of samples; 

(d) discarding samples for which said characteristic class is incorrectly 
predicted; 

(e) repeating steps (c) and (d) at least once; and 

(f) identifying a second set of genes informative for said characteristic 
class using samples in said plurality of samples remaining after step (e). 

34. The method of claim 6, wherein said cancer is breast cancer. 

35. A method for assigning an individual to one of a plurality of categories in a 
clinical trial, comprising: 

(a) classifying the individual into one of a plurality of condition categories 
differentiated by at least one genotypic or phenotypic characteristic of the condition; 

(b) determining the level of expression, in a sample derived from said 
individual, of a plurality of genes informative for said condition category; 

(c) determining whether said level of expression of said plurality of genes 
indicates that the individual has a good prognosis or a poor prognosis; and 

(d) assigning the individual to a category in a clinical trial on the basis of 
prognosis. 
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36. A method for identifying one or more sets of informative genes or markiers for a 
condition in an organism, comprising: 

(a) subdividing a plurality of individuals or samples derived therefrom of serid 
organism subject to said condition into a plurality of classes based on one or more clinical, 
phenotypic or genotypic characteristics of said organism, wherein each said class consists of 
a plurality of individuals or samples derived therefrom of said organism each having said one 
or more clinical, phenotypic or genotypic characteristics specific for said class; and 

(b) attempting to identify for each of one or more of said plurality of classes a set of 
genes or markers informative for said condition in individuals in said class, 

wherein, if a set of genes or markers informative for said condition in individuals in said class 
is obtained for any of said one or more of said plurality of classes, said set of genes or 
markers is taken as a set of informative genes or markers for said condition in said organism. 

37. The method of claim 36, further comprising, for each of one or more of said 
classes in which a set of genes or markers informative for said condition in individuals in said 
class cannot be obtained, repeating said steps (a) and (b) on said plurality of individuals or 
samples derived therefrom in said class such that said plurality of individuals or samples 
derived therefrom in said class is subdivided into a plurality of additional classes based on 
one or more clinical, phenotypic or genotypic characteristics of said organism which are 
different from those used for defining said class, wherein, for each of said plurality of 
additional classes, if a set of genes or markers informative for said condition in individuals in 
said class is obtained, said set of genes or markers is taken as a set of informative genes or 
markers for said condition in said organism. 

38. A method for identifying one or more sets of informative genes or markers for a 
condition in an organism, comprising: 

(a) subdividing a plurality of individuals or samples derived therefrom of said 
organism subject to said condition into a plurality of classes based on one or more clinical, 
phenotypic or genotypic characteristics of said organism, wherein each said class consists of 
a plurality of individuals or samples derived therefrom of said organism each having said one 
or more clinical, phenotypic or genotypic characteristics specific for said class; 
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(b) attempting to identify for each of one or more of said plurality of classes a set of 
genes or markers informative for said condition in individuals in said class, wherein if a set of 
genes or markers informative for said condition in individuals in said class is identified for 
any of said one or more of said classes, said set of genes or markers is taken as a set of 
informative genes or markers for a condition in said organism; and 

(c) for each of one or more of said classes in which a set of genes or markers 
informative for said condition in individuals in said class cannot be obtained, repeating said 
steps (a) and (b) on said plurality of individuals or samples derived therefrom in sadd class 
such that said plurality of samples or individuals in said class is subdivided into a plurality of 
additional classes based on one or more clinical, phenotypic or genotypic characteristics of 
said organism which are different from those used those used for defining said class, wherein, 
for each of one or more of said plurality of additional classes, if a set of genes or markers 
informative for said condition in individuals in said class is obtained, said set of genes or 
markers is taken as a set of informative genes or markers for a condition in said organism. 

39. The method of claim 38, wherein said condition is a type of cancer, and wherein 
each of said sets of genes or markers is informative of prognosis of individuals in a 
corresponding class. 

40. The method of claim 39, wherein said condition is breast cancer, and wherein said 
one or more clinical, phenotypic or genotypic characteristics comprises age, ER le-vel, 
ER/AGE, BRAC1 status, and lymph node status. 

41 . The method of claim 39, further comprising generating a template profile 
comprising measurements of levels of genes or markers of said set for said class 
representative of levels of the genes or markers in a plurality of patients having a chosen 
prognosis level. 

42. A method for predicting a breast cancer patient as having a good prognosis or a 
poor prognosis, comprising: 

(a) classifying said breast cancer patient into one of the following classes: (al) ER , 
BRCAI; (a2) ER", sporadic; (a3) ER+, ER/AGE high; (a4) ER+, ER/AGE low, LN+; or (a5) 
ER+, ER/AGE low, LN~; 



108 



WO 2005/086891 



PCT/US2005/007894 



(b) determining a profile comprising measurements of a plurality of genes or markers 
in a cell sample taken from said breast cancer patient, said plurality of genes markers 
comprising at least two of the genes or markers corresponding to the markers in (bl) Table 1 
if said breast cancer patient is classified as ER~ BRCAI; (b2) Table 2 if said breast carxcer 
patient is classified as ER~ sporadic; (b3) Table 3 if said breast cancer patient is classified as 
ER+, ER/AGE high; (b4) Table 4 if said breast cancer patient is classified as ER+, ER/AGE 
low, LN+; or (b5) Table 5 if said breast cancer patient is classified as ER+, ER/AGE low, 
LN~; and 

(c) classifying said breast cancer patient as having a good prognosis or a poor 
prognosis based on said profile of said plurality of genes or markers, 

wherein ER + designates a high ER level and ER" designates a low ER level, wherein said 
ER/AGE is a metric of said ER level relative to the age of said patient, and wherein LN + 
designates a greater than 0 lymph nodes status in said patient and LN" designates a 0 lymph 
nodes status in said patient. 

43. The method of claim 42, wherein step (c) is carried out by a method comprising 
comparing said profile to a good prognosis template and/or a poor prognosis template, and 
wherein said patient is classified as having a good prognosis if said profile has a high 
similarity to a good prognosis template or has a low similarity to a poor prognosis template or 
as having a poor prognosis if said profile has a low similarity to a good prognosis template or 
has a high similarity to a poor prognosis template, said good prognosis template comprising 
measurements of said plurality of genes or markers representative of levels of said genes or 
markers in a plurality of good outcome patients and said poor prognosis template comprising 
measurements of said plurality of genes or markers representative of levels of said genes or 
markers in a plurality of poor outcome patients, wherein a good outcome patient is a breast 
cancer patient who has non-reoccurrence of metastases within a first period of time after 
initial diagnosis and a poor outcome patient is a patient who has reoccurrence of metastases 
within a second period of time after initial diagnosis. 

44. The method of claim 43 „ further comprising determining said profile, said ER 
level, said LN status, and/or, said ER/AGE. 

45. The method of claim 44, wherein said profile is an expression profile comprising 
measurements of a plurality of transcripts in a sample derived from said patient, wherein said 
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good prognosis template comprises measurements of said plurality of transcripts 
representative of expression levels of said transcripts in said plurality of good outcome 
patients, and wherein said poor prognosis template comprises measurements of said plurality 
of transcripts representative of expression levels of said transcripts in said plurality of poor 
outcome patients. 

46. The method of claim 45, wherein said expression profile is a differential 
expression profile comprising differential measurements of said plurality of transcripts in said 
sample derived from said patient versus measurements of said plurality of transcripts in a 
control sample. 

47. The method of claim 43, wherein said profile comprises measurements of a 
plurality of protein species in a sample derived from said patient, wherein said good 
prognosis template comprises measurements of said plurality of protein species representative 
of levels of said protein species in said plurality of good outcome patients, and wherein said 
poor prognosis template comprises measurements of said plurality of protein species 
representative of levels of said protein species in said plurality of poor outcome patients. 

48. The method of claim 46, wherein measurement of each said transcript in said good 
prognosis template is an average of expression levels of said transcript in said plurality of 
good outcome patients. 

49. The method of claim 48, wherein similarity of said expression profile to said good 
prognosis template is represented by a correlation coefficient between said expression profile 
and said good prognosis template, wherein said correlation coefficient greater than a 
correlation threshold indicates a high similarity and said correlation coefficient equal to or 
less than said correlation threshold indicates a low similarity. 

50. The method of claim 48, wherein similarity of said expression profile to said good 
prognosis template is represented by a distance between said cellular constituent profile and 
said good prognosis template, wherein said distance less than a given value indicates a high 
similarity and said distance equal to or greater than said given value indicates a low 
similarity. 

51. The method of claim 49, wherein said correlation threshold is 0.5. 
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52. The method of claim 51 5 wherein said ER level is determined by measuring an 
expression level of a gene encoding said estrogen receptor in said patient relative to 
expression level of said gene in said control sample, and wherein said ER level is classified 
as ER + if loglO(ratio) of said expression level is greater than -0.65, and wherein said ER level 
is classified as ER" if loglO(ratio) of said expression level is equal to or less than -0.65. 

53. The method of claim 52, wherein said gene encoding said estrogen receptor is the 
estrogen receptor a gene. 

54. The method of claim 53, wherein said ER/AGE is classified as high if said ER 
level is greater than c(AGE - d), and wherein said ER/AGE is classified as low if said ER 
level is equal to or less than c { AGE - d), wherein c is a coefficient, AGE is the age of said 
patient, and d is an age threshold. 

55. The method of claim 54, wherein said estrogen receptor level is measured by a 
polynucleotide probe that detects a transcript corresponding to the gene having accession 
number NM__000125, wherein said control sample is a pool of breast cancer cells of different 
patients, and wherein c = 0.1 and d- 42.5. 

56. The method of claim 55, wherein said control sample is generated by pooling 
together cDNAs of said plurality of transcripts from a plurality of breast cancer patients. 

57. The method of claim 55, wherein said control sample is generated by pooling 
together synthesized cDNAs of said plurality of transcripts and said transcript of said gene 
encoding said estrogen receptor. 

58. The method of claim 42, wherein said individual is ER", BRCA1, and said 
plurality of genes comprises at least two of the genes for which markers are listed in Table 1 . 

59. The method of claim 42, wherein said individual is ER", BRCA1, and said 
plurality of genes comprises all of the genes for which markers are listed in Table 1 . 

60. The method of claim 42, wherein said individual is ER", sporadic, and said 
plurality of genes comprises at least two of the genes for which markers are listed in Table 2. 

61. The method of claim 42, wherein said individual is ER", sporadic, and said 
plurality of genes comprises all of the genes for which markers are listed in Table 2. 
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62. The method of claim 42, wherein said individual is ER+, ER/AGE high, and said 
plurality of genes comprises at least two of the genes for which markers are listed in Table 3 . 

63. The method of claim 42, wherein said individual is ER+, ER/AGE high, and said 
plurality of genes comprises all of the genes for which markers are listed in Table 3. 

64. The method of claim 42, wherein said individual is ER+, ER/AGE low, LN+, and 
said plurality of genes comprises at least two of the genes for which markers are listed in 
Table 4. 

65. The method of claim 42, wherein said individual is ER+, ER/AGE low, LN+, and 
said plurality of genes comprises all of the genes for which markers are listed in Table 4. 

66. The method of claim 42, wherein said individual is ER+, ER/AGE low, LIST, and 
said plurality of genes comprises at least two of the genes for which markers are listed in 
Table 4. 

67. The method of claim 42, wherein said individual is ER+, ER/AGE low, LN~, and 
said plurality of genes comprises all of the genes for which markers are listed in Table 4. 

68. The method of claim 42, wherein said profile further comprises one or more genes 
for which markers are not found in Tables 1-5, wherein said one or more genes are 
informative for prognosis. 

69. A method for assigning an individual to one of a plurality of categories in a 
clinical trial, comprising assigning said, individual to one category in a clinical trial if said 
individual has a good prognosis as determined by the method of any one of claims 7-33, and 
assigning said individual to a second category in said clinical trial if said individual has a 
poor prognosis as determined by the method of any one of claims 7-33. 

70. The method of claim 69, wherein said individual is additionally assigned to a 
category in said clinical trial on the basis of the classification of said individual as determined 
in step (a). 

71. The method of claim 69, wherein said individual is additionally assigned to a 
category in said clinical trial on the basis of one or more other clinical, phenotypic or 
genotypic characteristic of breast cancer. 
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72. The method of claim 69, further comprising determining in said cell sample the 
levels of expression of said one or more genes for which markers are not found in Tables 1 -5, 
and determining from said expression levels of said one or more genes, whether said 
individual has a good prognosis or a poor prognosis. 

73. A microarray comprising a plurality of polynucleotide probes each 
complementary and hybridizable to a sequence in a different gene listed in any one of Tables 
1-5. 

74. A microarray comprising a plurality of polynucleotide probes each 
complementary and hybridizable to a sequence in a different gene listed in Table 1 . 

75. The microarray of claim 74, wherein said plurality of polynucleotide probes 
comprises a probe complementary and hybridizable to each of the genes listed in Table 1. 

76. A microarray comprising a plurality of polynucleotide probes each 
complementary and hybridizable to a sequence in a different gene listed in Table 2. 

77. The microarray of claim 76, wherein said plurality of polynucleotide probes 
comprises a probe complementary and hybridizable to a sequence in each of the genes listed 
in Table 2. 

78. A microarray comprising a plurality of polynucleotide probes each 
complementary and hybridizable to a sequence in a different gene listed in Table 3. 

79. The microarray of claim 78, wherein said plurality of polynucleotide probes 
comprises a probe complementary and hybridizable to a sequence in each of the genes listed 
in Table3. 

80. A microarray comprising a plurality of polynucleotide probes each 
complementary and hybridizable to a sequence in a different gene listed in Table 4. 

81. The microarray of claim 80, wherein said plurality of polynucleotide probes 
comprises a probe complementary and hybridizable to a sequence in each of the genes listed 
in Table 4. 

82. A microarray comprising a plurality of polynucleotide probes each 
complementary and hybridizable to a sequence in a different gene listed in Table 5. 
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83. The microarray of claim 82, wherein said plurality of polynucleotide probes 
comprises a probe complementary and liybridizable to a sequence in each of the genes listed 
in Table 5. 

84. The microarray of any of claims 73-83, wherein said plurality of polynucleotide 
probes constitutes at least 50% of the probes on said microarray. 

85. The microarray of any of claims 73-83, wherein said plurality of polynucleotide 
probes constitutes at least 90% of the probes on said microarray. 

86. The microarray of claim 73, wherein said plurality of polynucleotide probes 
comprises probes complementary and kybridizable to at least 75% of the genes listed in Table 
1, Table 2, Table 3, Table 4, or Table 5, wherein said plurality of polynucleotide probes, in 
total, constitutes at least 50% of the probes on said microarray. 

87. A kit comprising the microarray of any one of claims 73-83 in a sealed container. 
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Sequence Listing 
<110> Rosetta Inpharmatics , LLC 

<12 0> Classification of Breast Cancer Patients Using a Combination 
of Clinical Criteria and Informative Genesets 



<130> 9301-251-228 

<140> 
<141> 

<150> 60/650,401 
<151> 2005-02-04 

<150> 60/604,076 
<151> 2004-08-24 

<150> 60/550,810 
<151> 2004-03-05 

<160> 366 

<210> 1 

<211> 4946 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AB032969 
<400> 1 

cagcctcagc ccccagatga agatgggga.t cacagtgaca aagaagatga acagcctcaa 60 

gtggtggttt taaaaaaggg agacctgtca gttgaagaag tcatgaaaat taaagcagaa 12 0 

ataaaggctg ccaaagcaga tgaagaacoa actccagccg atggaagaat catatatcga 18 0 

aaaccagtca agcatccctc agatgaaaaa tattcaggtt taacagcaag ctcaaaaaag 240 

aagaagccaa atgaagatga agtaaatcag gactcggtca aaaagaactc acaaaaacaa 30 0 

attaaaaata gtagcctcct ttcttttgac aacgaagatg aaaatgagta agtgtaaata 360 

ttttgaattt agtctacttt gaaagtatat ggagtgttca ttaaaatcac attttttcct 42 0 

attataaaga tactacaagt tctttataga aagtttagga aatagagaaa aaaatttaat 480 

aaactacatc tattcatcaa tacccctctg acttaaaatg ccaactctat agaaattagc 540 

tagtattaac attttgttat ttcccttgtg tggttgtata tatatgtaaa ttatattttt 600 

aagcaaaata cattttttgt gtgtaaacaa aattttataa atacaactgt attgcaaatg 66 0 

ttctttgtcc tgcttctcac ttgacattgc attatgagta ttcttccagg tcagtaaatt 72 0 

tcaaaaacct gacattaata gctacagata atttcataaa catctcattg tatctttttc 780 

attagcaata gctccacttt gggtggggg-a gatgataatg tgccttgtta aaaatacctc 840 

cccaactcct gctaagggtg gccatgagac tcagctctgg caagttaaga aatacaggtg 90 0 

gaattctgct tgataaagct gctgggtttt ttgttacaaa aggacagact tggcaaacat 960 

gagcctttgc tcttatcttt tcatcctact tggagtgcag agataaaacc tgagtaccag 102 O 

agccactttt aggcataagg aaggcagcca tgtgctttgg gtcatgttag taaaaagact 108 0 

cagagcttgg ctccttgctg acatgcctgg aggagctgct acaccagctt ggattgctga 114 0 

cctctgactt cttggtagtg agaagaataa acactgtgct taattaggcc ttggtcaggt 12 0 O 

ttcttttata tgcagccaaa tgcagtccta agtaatacaa taaataactg gtcaaactgt 12 6 0 

tactggtgga gggtgtccag gttcttgg-ca ttttggacaa ataattgaac aaaacgcaca 132 O 

aagcaatgaa tatcctctag aggtttgcca ttggttactt ggcgtacacc ctgtgtaaat 138 0 

gaagtagtgg cccgtgacct gtctgattgg tgcagaaagt gaccaatcag aggctgaagt 144 0 

gaagttacaa agttatactc ctgtgtaa.a.t gaggacttgg cctatgacca gtctgattgg 150 0 

ttgcaggagg ggaccaatca gaggcacttt catttttcat ctgcaatgca gaaaaggcaa 156 0 

ggggattgca aagggagtag cctctgatcc ttttgttact taggtatgga gaggtggggt 162 0 

tttccttttg attcagttct aggaagtcaa tgtgaatcag ccttaggttc cctgtctcca 168 0 
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gaccctattc tcctgcctca ttttccccct gagagacgtg atcctcgtaa atctttatgg 1740 

gaggctgaga gactgagggt ctttcttctg taactgcttc atgctaactt gggacacagt 1800 

ccctacctat tggagatcac gtaactctca ccctgctttg tctaggggag acagggtagc 1860 

ttcttgatgg ccggtggtgt cttctcctga aactggctag aaatcttgtc acatgatcat 192 0 

ctaacttggt ggtctctagg caaaaggaaa tggatttggt taaaagattt aacagatatg 1980 

gtccaaaaac caaggcaaat ataatcatta ataatgggct ggccaaggga gggagccatg 2 040 

aaacccaact tagtgccctt taggtgcccc agctgttgtc atattttaga ggcccagtca 2 100 

gctagttttc aggtggtgtc ccttactaat cctgattggt tgacatcaaa acagcattct 2 160 

tcttctagga aaatacataa gccacctgtt tcagcagtta ggagatctag tccccttcga 2 22 0 

ttttgcaaag cgaccactgc caaggagcct atccgaattt gtaaggtgac aatactttga 2 280 

gcaatgttat ccaggctttc cataaaatcc ttggacaagc gttggtaata ggatagggaa 2 3 40 

gttgcaatcc cgctaactcc cattcctacc tctgctgtta ttcctagccg ttgtgtctgg 2 400 

tggttgcagt taaaggtata atgagggatt ggttgttggg agctatatta atttagggac 2 460 

atacaatatt tctgtctcca gtctaccact tccaccaaag acaaatcaca gcagaaccga 2 52 0 

cctaacttca aaataaactg cagtcccata tactgggcct gattacccac acaaagtgca 2 5 80 

acaagaatca ttgtccatat agactctcct agattggctt tgctagaaca tttcacaagg 2 640 

ccatttcagt caaagtcctg agaaagtaac cggtttcaat tgtgccctat tacaaaagaa 2 7 00 

aacgtggtta ttaactttat acagacaaat gccatgaatt aagaatattc ataaatagtt 2 7 60 

tacaaattct ggagaaatta gaatactcaa tacacttaaa gtgtatttca aggctataaa 2 82 0 

tagctcaaaa taaaaagatt attcagactc tgaaaaaaca aaaagaagta gcaatatttc 2 880 

aaacaacaaa agccatacaa attatttcag tcttccatta gttcatttca gtccatgtaa 2940 

tcaactcctg ctctacttca tattcatctt tatgaacaca tcagcctttc aattagtgcc 3 000 

ttggaagttt tctgtctaat ccaatggcac actctccaaa gttaccagaa acctgcattc 3 06 0 

aagagttctt ttcatgaact ccaaagaagt aagccttgga ctgtagctga ttataagtca 3120 

cttttttttt ttgagaagga tcaaagcaaa acatcaatta tggatgacaa aagtcttaag 3180 

acagccataa agacacagtt gacaaatgtg gctatttctg tggcttacaa caatttaaca 3 240 

taatcattac aacatatatt aagacatatc agaattttag aactctcata caatcctgga 33 00 

acacatatta acaacaaatc tctatcagta taacccaaag gaagctaaac accacctcac 33 6 0 

acttgacaat gtttcctgta taattcaaac attacaaata agcctaatat aagcctaata 3 42 0 

tgtcactctt gaacttcagg aagcctaata tccaaaaagt tagtttaagg tcaaaagttt 3 480 

ttgaattaac ttttttccat tagtatgcjtc atatctttct tactaatttg taagttatgt 3 540 

aatttatcaa tttttttttg ttgttctcjtt tcccaacctc tatgtcagat aaagaatcac 3 600 

ccaggccaga cacagtggct catgcttcrtg gtcccagcac tttgggaagc caaggtggga 3 6 60 

gaattgcttg aagccaggaa tctgagccca gcctgggcga caaagcaata cccctatctc 372 0 

tacaaaaaat aaaaaatagc caggtgtcjgc gacacacacc tgtggtccca gctgctcggg 3 7 80 

aggctgagcg ggaggatggc ttgggcccag gggttcaacg ctgcagtgag ctgtgattgc 3 840 

gccactgcac tccagcctgg gcaacagagt aagaactgtc tcaaaaaaaa taaaaaatag 3 9 00 

aaataaattt taaaaaaaga attacccata ttctctttgt ttttgtttat tcacattaac 3960 

ctttattcta tctggaattt atttgagtat acttttttct caaataatca attgtcctag 402 0 

aaccatgtgt ttctcattta tttgaaacjgc catctagtga gagatttctc caaatgttgg 4080 

ggtagggaag ggaggggaag cactttaaag tctgagcctt tagaggtgat tcctcaagac 4140 

cctgcttaat cctaacaatt ttcctcatta gtaaaagtca gcccaaactg ggggcttgtt 42 00 

aagatcctta ccagccacat ccatctgaaa ttatgaattt caaagtatct tacaaatttg 42 60 

gtgccacatt atctttttta agtttgtttt gttttgtttt tttgagacag agtctcgctc 432 0 

tgtcacccgg gctggagtgc agtggcgcga tctcagctca ctgcaagctc cgcctcctgg 43 80 

gttcacacca ttctcttgcc tcggcctccc aagtatctgg gactgcagtc gcccgccacc 4440 

acgcccggct aatttttttg tatttttaat agagacgggg tttcactttg ttagccagga 45 00 

tggtctcaat ctcctgacct catgatccac ctgcctcggc ctcccaaagt gctgggatta 4560 

caggcaggag ccaccgcgcc tgggcctttt tttaagtttt aagtacctat aaagaacact 462 0 

gaaaggtgat gtgtgtggat gagctaggaa gacctgaaat aggctctctc taaattaatc 46 80 

aaattaatcc tgaagccatt ctgcaatact gtctttaatg tatactcact tgttatagaa 4740 

gccagggttt tttcccctaa tttgtatcat tgctatatgt gttattgtac caaactacac 4800 

tgttttaatt gctgtaaatt ttaatatgtc ttagtatctg ggtgtgggaa tcttgaaagc 4860 
atggagtttg tgttattcac cactgtattc tcaaatatca gaagagtatc tggcctacta 492 0 
agtgcacaat aaacatagtt aaaatg 4946 



<210> 2 

<211> 60 

<212> DNA 

<213> Homo sapiens 
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<300> 

<308> AB032969 
<400> 2 

taatcctgaa gccattctgc aatactgtct ttaatgtata ctcacttgtt atagaagcca 60 

<210> 3 

<211> 1007 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> AF005487 



<400> 3 

gaatacagaa tgtgggcaaa ctcgcttctg 
atcactccct ggtgccgggc tccttgaggt 
tgaggaatgc agtcttgcaa gcctgctctg 
gagcaagcac gtttcttgga gcaggctaag 
cgggtgcagt acctgaacag atacatccat 
aacgtggagg agttccaggc agttacggaa 
agccagaagg gcatcccgga ggagaagcgg 
tacggggttt tttgagagct tcacagtgca 
tcctgcaaag acccagcccc tgcatcaccg 
ctatccaggc agcattaaag tcacjg-tggtt 
ggtctccata ggcctgatcc agaatggaga 
aacagttcct cggagtggag aggtttacac 
ccctctcaca gtggaatgga gtacacggac 
agtcgggggc tttgtgctgg gcctgctctt 
gaatcagaaa ggacactctg gacttcagcc 
gaagtgcaca tgaccacatt caaggaagaa 
ctttcccact tggctcttat tcttccacaa 



tgccggccgc cagaaggttt gctgagggca 6 0 

tatgcactgg gacatctaga gcctattgtt 12 0 

gatcaagcca cagactgaaa cacccccgaa 180 

tgtgagtgtc atatcttcaa tgggatgaag 2 40 

aaacgggagg agaacctgcg cttcgacagc 3 00 

ctggggcggc ctgtcgcaga gaactggaac 3 60 

gacaagatgg acgactactg cagatacaat 42 0 

gccgcgagtc catcctaagg tgactgtgta 4 80 

caaccccctg gtcggctctg tgagtggttt 540 

ccagaatggt caggaagaga aggctgcggt 6 00 

ttggaccttc cagaccctgg tgatgctgga 6 60 

ctgccaagtg gagcatccaa gcgtgacgag 72 0 

tgaatctgca cagagcaaga tgctgagtgg 7 80 

ccttgggaca gggctgttca tctacttcag 84 0 

aacaggactc ctgcgctgga ctcctgagct 9 00 

ccttctgcca cagctttgca ggatgaaaag 9 60 
gagctctctc aggacca 1007 



<210> 4 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> AF005487 



<400> 4 

tttgcaggat gaaaagcttt cccacttggc tcttattctt ccacaagagc tctctcagga 60 



<210> 5 

<211> 3200 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> AF026941 



<400> 5 

caggaagggc catgaagatt aataaagatt 
aataactcaa agaattactg ttgaataaat 
gcggatgcac acaaatgaaa ccctcacttc 
agagggacct gcaggaagcg gtagagaaaa 
aatttcagga tctttgtcac tgacagcagc 
aagaacaagg ctgagcgcga tgactcacgc 
caggcagact gcttgagccc aggagttcca 



tggactcagg gcaaatattt acttagtagc 6 0 

aagccaatta agcagccaat cacgtactat 12 0 

aacctgaaga cattcgcaca tgagttacgt 180 

cataaggctt atgcgtttaa tttccacacc 240 

actaagactt gttaacttta tatagttaag 3 00 

ctgtaagcct agaactttgg gaggccaaag 3 60 

gaccagcctg ggcaacatgg caacacccca 42 0 
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tctctacaaa 
actcgggagg 
gagatctcgc 
aataaataca 
tctcatctgg 
gctgggaagt 
ccgctgttct 
cagctggtcc 
accaccccaa 
tgtttccaca 
cttttgctta 
caagaccggg 
cccagcgtga 
ggtgagtatt 
cttattggcc 
tggtgtaggg 
gaagaggaca 
tgcctcttaa 
gttattggtg 
gtgcctgaat 
tttctgaact 
gaagaagcta 
tacatatgga 
tcaacacacc 
ggtatttccc 
gtcactgaac 
ttacttcatt 
tttagctaaa 
ccctatacac 
gatatctgtg 
tttctatatt 
gtattttttg 
gcctcaatta 
gtcttgtact 
atatgataat 
gcgtggtgca 
tacctgttgt 
ttgatgatgg 
cacatgaggc 
ttctgagaag 
tgggcaagga 
catgtcacca 
ggtcagttta 
tagatcaggg 
ttagtatctt 
gaaaattttt 
aaaaaaaaaa 



aaaatacaag 
cagaggcagg 
cactgcactc 
taaatataag 
ctctgctcca 
tcttgagtgt 
gctggctgag 
tgagagggcc 
ccagcgtcaa 
cagccaaaac 
aggaagctgg 
gagaatacct 
gcatcgtgag 
tggacattct 
gtggccaagg 
attatagaat 
tgacggaaca 
ttgaaggtga 
atgaagaatt 
ctaaccagaa 
gtagaaaggg 
taaaattcag 
gtaaggctga 
agtgggaaaa 
agtggctgaa 
acacgaataa 
ggcttataac 
aagaaggaat 
aggacctgac 
ggggcaatat 
tccattttga 
ccaaatatcc 
tttctgctgg 
ttttactgtg 
cctaccaatt 
gcccagccca 
gtccctttct 
agttgacatg 
tgtcaagcaa 
tgacgttaca 
agaatgtgag 
tatcatattt 
agaagagtca 
aacaaaatcc 
attgcctaaa 
gttgtccagg 
aaaaaggcca 



aatcagctgg 
aggattgctt 
cagtctggac 
gaaaaaaata 
ggcatctgcc 
gtt caggcaa 
ggcaaccttc 
agatgagacc 
ctatcacttc 
atcctttgtg 
tatggagaag 
ggg-caagttg 
caatggaagc 
cgctatctcc 
aaagaagaac 
ccctttcaag 
gatcaaagca 
gaattgtgga 
tgaaagattc 
gatgaaagac 
acggaaggac 
tgcjatttgat 
tctgaagctg 
ctcctagagt 
aacctgattt 
cttggatagc 
cttgttgtta 
acacacagga 
atttagctca 
ttaatttgga 
aactatttct 
agataaccag 
ttataatgct 
atgtacagaa 
ttcaagaagt 
cg-cjtgcctgt 
cttcaaagat 
gaggcagtgc 
aagaatagga 
ctgggttggc 
caagagtaga 
ttgaatgaac 
gctcagagaa 
tctccttgtg 
aaaaaatttc 
caaataaaag 
3 200 



gcgtggtgat 
gaacccggga 
gacagagtga 
aagctgcttt 
acaatgtggg 
cctctgagct 
tggctgctag 
aaagaggagg 
actcgccagt 
ctgccccttg 
atcaactttt 
gtgaggttct 
ctgatccggg 
tgtgacagct 
catgtggaaa 
ataaattctg 
ctaaaccctg 
gaagatgctc 
ttggagcgcc 
tcctacctta 



ccttccaagt 
gaaaagatgt 
gattggtaga 
aactgccatt 
tctgctgcac 
aaatcctgag 
ttgaaacagc 
ataatgaccc 
atgatgcgtt 
tttgattttt 
tgttccaggt 
ttttcacatc 
tttttttttt 
atagtcaaca 
ctctagaaag 
tccatgaatg 
ccctgagcaa 
ttgcattgct 
gtgtagttga 
ataagatatc 
gagagtgcct 
tctgagtcag 
agcaagcata 
gaaatatccc 
ttatcattgt 
gtcattttaa 



gtgttcctgt 
ggcagaggtt 
gactcagtct 
ctcctcttcc 
tgcttacacc 
ctctgtggag 
ctaccaagag 
aagaggaccc 
gcaactacaa 
aggaagcaaa 
caggtggaga 
gcaaagtaga 
agaggtggtt 
ttgacgagga 
accttcaaaa 
tcattaatcg 
tccgctggaa 
taagagaagc 
acaaagaagt 
ttctggatga 
ccatcctgga 
ttctgaagcg 
gcggaaagtg 
gtctgcaata 
gtggcatctg 
acaatggaaa 
acttctgttt 
caaaaatgct 
tgtaagaaat 
taaaacaatg 
ttgttcattt 
tgagacatta 
tttgctttta 
gatgtttcca 
agataacaca 
ctggctacct 
aacaaagata 
ttgttcgcct 
gtagctggtt 
ctaaaatcac 
ggatttcatg 
ttgaaatagg 
agggaaaatg 
atgcagtttg 
ttcaaaaaag 
tttaaaaaaa 



aatctcagct 
gtagttagcc 
caaataaata 
tcctctttgg 
tgctgctttt 
gagcctggtc 
gagaaagcag 
tcctctgccc 
atgcggcttc 
gagaggattg 
gccatttctt 
gttgcggctg 
ccagaattat 
agtcaatgtc 
gctgaggagg 
tttcaacgtg 
agtgttccag 
agaaagattt 
gtcctgcttg 
atatatgcgc 
tgttggtgta 
aggaggaaaa 
gaacgagact 
ctatcccgtt 
attacctgtg 
accattaact 
ttgagtttgt 
tagataaggc 
aagctctagt 
tttactgcga 
gacagagtca 
caaagtatct 
tgccattgca 
agaacatatg 
tggaaagacg 
atgtgtgtgg 
cgctttccat 
atcatctggc 
ggccctacat 
gctggaacct 
tcagtgaagc 
gtaccatcta 
tcacgtaaac 
ttgatacaac 
caaaatcatg 
aaaaaaaaaa 



480 

540 

600 

660 

720 

780 

840 

300 

960 

1.020 

1_080 

3.140 

1.200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 



<210> 6 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AF026941 
<400> 6 

atttttgaat gaactctgag tcagttgaaa tagggtacca tctaggtcag tttaagaaga bu 



<210> 7 
<211> 1799 
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<212> DNA 

<213> Homo sapiens 

<300> 

<308> AF035284 
<400> 7 

gcttgaaccg gggaggtgga ggttgcagtg agctgagatc acgccattgt actccagcct SO 

gggcgacaga gcaagactcc atttcaaaaa aaaaaaaaaa aaaaaaaatc cactcatata 120 

aaaggtgagc tcagctcact ggtccatttc tcagtggctt ctccatcctc atttgcaaac 1.80 

ctcagaggga taaggcagtt gaacctgatg agcaagaatt ataacagcaa ggaaacatta 2 40 

atgcttagaa ttctgagatc cagcacaact cagtctgtgg gagctcagct cgctgcccag 3 00 

ggataggtat gacctatgtc tgccttaggc tgctgggaga tgccattctc cagtttcaga 3 60 

agcaggcagg gcaaaggtca agsictgtggt attggggtct tttggctctg aaggatcctg <=X2 0 

gaaccactga ttttggttta ttccctccag ggtctaaaga gaacaagagg tgctagctct 4=80 

taccaaaaca gatggtagag ageigttgctg gctatttaaa aagctctttc atcttttaat 540 

tcacctcttc ttttcacctc tttaaccact cctcaggaac agaacacttc taggactggg 6 00 

ggtcttttag ctccataagc aagtgagcag atgggacaag ttagtctttt ctccctagaa 6 60 

acaaagggga tgcccagtgg tttccctttg cttcccaacc taaaatttca agtttaataa 720 

aatagcaatt agcagaagtg accaaattgg gagataatta tcagtcatga ggaaagacac 7 8 0 

agatttcggt cataaagaat gtaagggcta taagtagaaa ctttctataa cctaaatgat 840 

gttatagaat tatttttgag caggagcaga aagattaaat atgatcactt catacttcta 9 00 

aatcagaaat aggaagatta aaaccacaga acagtttgtg atttctattg ctggtagcta 9 60 

ggtatcttac tctgtccact cttgttcaag tatctaactc ttctggaaac caaataggct 1020 

ttagaagaga ttatcctata ttcctatcag tataatacta aaatgtaact ttttaatcat 1080 

ctggttttta aaagataaac agtttagccc atctctccag agagcaaaca taggaatatg 1140 

actcaggagc ctcctagggc ttatcatcag ccctcacacc cgcttccccc tccaacccac 12 00 

agcctttgct tccaggtggc aggattacta ctttgcctct tcagcagcat ctactctagg 12 60 

catattgatc attttagaca ctgggagaag agaacctcaa actaggagga aaagacagag 132 0 

cctccactta gttttgggag gg^atggcag acagtcaagg agatgagcgt cctaaggcat 13 80 

gttgggatag ggtcagatgc accacccatg gagaggtttg tcaacacaaa gacatggaag 1440 

gttagaggtt tgtcaacaaa aagacatgga aggttaggtt tgtcaacaca aagacatgga 15 00 

agattagagg tttgtcaaca caaagataca ggaagaatgg gctgcagaag atttagatgt 15 6 0 

tttccatttg ggcacatttt acttagctgg agaactaggt ttaaaacagc ctgggtagga 162 0 

aaattagaag caagctggat gcagtggctc atgcctgtaa tcccaacact tttgggaggt 16 80 

ccaggcagga ggatcacttg ggcccaggag gtcaagcctg cagcgagctg agatcacacc 1740 

actgcactcc agcctggggt gatagaacaa gaccctgtct caaaaaaaaa aaaaaaaaa 17 99 

<210> 8 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AF035284 
<400> 8 

caaaaagaca tggaaggtta ggrtttgtcaa cacaaagaca tggaagatta gaggtttgtc 6 0 

<210> 9 

<211> 1380 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AF052162 
<400> 9 

gtcaaaggat atttatttat ag-gccttttt ttttttaata tagaatctga ggctgtttgg 60 

gctttgactt aaatttccat caggcctctc tccagcaggt aatccctctc cttccgctgg 12 0 

gtcccctggg gaggtgtgaa ctcaagggcc tagccccaaa acactttttc tgcttttctt 180 
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aatccttttc 
cataacgtaa 
tgagcagggc 
tcaggccagg 
actggcgtca 
cttttgcttg 
tgacagccgt 
gcacatccca 
caggccctgg 
tcatttttta 
agtttgcttc 
tgaattgatt 
cgggaaaaaa 
acatttaaaa 
ggacactgcc 
gcgtgttgga 
ataattggag 
aaggaggtaa 
ggtctgcaga 
ctctgataaa 



cagtcccctc 

tccatttcac 

ccgagctgga 

tccctgctgg 

cgcagccacc 

tggaaggaga 

gcggacacca 

ctgagagcgg 

atggggtatt 

tgacagaatt 

ctaaagcaaa 

tttatatttg 

tgtttactaa 

acaaccttca 

agcaggacag 

cacagaagaa 

aatgcgtaaa 

ataaactaag 

tgaagggttt 

cagactttga 



tttttttata 
tgtagcctaa 
agcgcaaggc 
aagcggctgc 
cgagtatggc 
cgctgggctg 
ctcctctctg 
gggtcctgcc 
ttggtgtcac 
agaaacccat 
ggctccgggt 
gggcatctct 
cttcctcaga 
gctccgtttc 
acagggatgg 
atcctagtgc 
gattcatttt 
atattccgag 
tttgaatgaa 
atggaaaaaa 



aacgttggca 
actccagtcc 
agccgccgcc 
atcttcctgt 
tgaccttcct 
tgcggtgcgg 
cagcactgcc 
ccatcttaga 
ctgaagtccc 
ccttcaagca 
ttgtttggaa 
atagaaagtg 
gattcgtgat 
aatcaatcac 
agtaaaccga 
agcctttggt 
tcaaggagaa 
ggaagggacc 
atgccactgt 
aaaaaaaaaa 



gtttgatgtt 
gaggttggat 
gtgccgctcc 
cagccctggt 
gcagagagag 
agggtgatga 
tcccagcgcc 
gtcaaaggca 
tctgacatca 
caataatcat 
aatttttttg 
accaccaagg 
acgcgtttct 
ctcgacttgt 
agtcaatttc 
agctaacagt 
gagcctgcaa 
caggccacct 
gcattttcag 
aaaaaaaaaa 



tctgtttcgg 
attgttcaaa 
tcccttgccc 
ttccatggtg 
gagccgcagt 
ggatgtctgg 
agggtcgcgg 
gaggggcttc 
ccttgtttca 
cacagacttg 
atttctgaaa 
ccagtaagta 
ccactgacag 
tttttagcat 
agggctcttg 
cactgatttt 
atggccaatg 
cccttccgca 
aaaaaaaaat 
aaaaaaaaaa 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 



<210> 10 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AF052162 



<400> 10 

cagtaagtac gggaaaaaat gtttactaac ttcctcagag attcgtgata cgcgtttctc 



60 



<210> 11 

<211> 1722 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AF055033 



<400> 11 

ggggaaaaga 

ttttcattac 

tcgaagagga 

ttgtttttgt 

aacgcctctt 

ggtgtatttt 

caacgccatc 

tttgtggctt 

ccgctcactg 

tcttgcaact 

aagtgtctgg 

aaaaaaatct 

accccgaggt 

gctggccgcc 

cgacgagaaa 

gccgggctgc 

caccgagcgc 

gcacgccctg 

caagatcgag 



gctaggaaag- 
ccctcctccg 
ggtggcaaag- 
tttttaattt 
acctgttctg- 
agattttaacj 
tccactgcat 
tttttcccct 
cgtgcacctg- 
gggacctgct 
gggagggcao 
ccgggggccc 
aaagggggcg- 
tatgcggggc 
gccctctcca 
ggctgctgcsi 
tgcgcccagg* 
ctgcacggcc 
agagactccc 



agctgcaaag 
ttttcaccct 
tgggagaaaa 
cttgatttca 
cggcgccgcg 
caaaaatttt 
ccgatctcat 
ataattctga 
gcgctgccct 
cgcaggcacc 
ctgctctacc 
tcttggcccc 
actaagagaa 
cggcccagag 
tgtgcccccc 
tgacctgcgc 
ggctgcgctg 
gcggggtttg 
gtgagcacga 



cagtgtgggc 
tctccggact 
gaggtgttag 
acattttctc 
caccgctggc 
aaagataaat 
tatttcggtg 
cccgctcagg 
gcttccccca 
ccagccctcc 
tgccagaaat 
tttatccctg 
gatggtgttg 
cctgggctcc 
cagccccctg 
cctggccgag 
cctcccccgg 
cctcaacgaa 
ggagcccacc 



tttttccctt 
tcgcgtagaa 
ggtttggggt 
ccaccctctc 
agctgagggt 
ccatttttct 
gttgcttggg 
cttgagggtt 
acctgttgca 
acctctctct 
tttaaaacaa 
cactctcgct 
ctcaccgcgg 
ttcgtgcact 
ggctgcgagc 
gggcagtcgt 
caggacgagg 
aagagctacc 
acctctgaga 



tttttgctcc 60 

cctgcgaatt 12 0 

ttttttgttt 180 

ggctgcagcc 2 40 

tagaaagcgg 3 00 

ctcccacccc 3 60 

ggtgaacaat 42 0 

tctccggcct 480 

aggctttaat 540 

acatttttgc 600 

aaacaaaaac 660 

ctcctgcccc 72 0 

tcctcctgct 780 

gcgagccctg" 840 

tggtcaagga. 900 

gcggcgtcta. 960 

agaagccgct 102 0 

gcgagcaagt 1080 

tggccgagga. 1140 



6 



WO 2005/086891 



PCT/US2005/007894 



gacctactcc cccaagatct tccggcccaa acacacccgc atctccgagc tgaaggctga 12 00 

agcagtgaag aaggaccgca gaaagaagct gacccagtcc aagtttgtcg ggggagccga 12 60 

gaacactgcc cacccccgga tcatctctgc acctgagatg agacaggagt ctgagcaggg 132 0 

cccctgccgc agacacatgg aggcttccct gcaggagctc aaagccagcc cacgcatggt 13 80 

gccccgtgct gtgtacctgc ccaattgtga ccgcaaagga ttctacaaga gaaagcagtg 1440 

caaaccttcc cgtggccgca agcgtggcat ctgctggtgc gtggacaagt acgggatgaa 1500 

gctgccaggc atggagtacg ttgacgggga ctttcagtgc cacaccttcg acagcagcaa 15 60 

cgttgagtga tgcgtccccc cccaaccttt ccctcacccc ctcccacccc cagccccgac 162 0 

tccagccagc gcctccctcc accccaggac gccactcatt tcatctcatt taagggaaaa 1680 
atatatatct atctatttga ggaaaaaaaa aaaaaaaaaa aa 1722 



<210> 12 
<211> 60 
<212> DNA 
<213> Homo 



sapiens 



<300> 

<308> AF055033 



<400> 12 

tccaccccag gacgccactc atttcatctc atttaaggga aaaatatata tctatctatt 60 



<210> 13 

<211> 1411 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> AK001166 



<400> 13 

aaacaaagag atgccacccc tgtgtgatgg ctttggtacc cgaacactga tggttcagac 60 

attttcccgt tgcatcttgt gttccaagga tgaagtggac ttggatgagt tattagctgc 12 0 

tagattggta acgtttctga tggacaatta ccaggaaatt ctgaaagtcc ctttggcctt 180 

gcagacctct atagaggagc gtg-tggctca tctacgaaga gtccagataa aatacccagg 240 

agctgatatg gatatcactt tatctgctcc atcattttgc cgtcaaatta gtccagagga 3 00 

atttgaatat caaagatcat atggctctca ggaacctctg gcagccttgt tggaggaagt 3 60 

cataacagat gccaaactct ccaacaaaga gaaaaagaag aaactgaagc agtttcagaa 42 0 

atcctatcct gaagtctatc aagaacgatt tcctacacca gaaagtgcag cacttctgtt 480 

tcctgaaaaa cccaaaccga aaccacagct gctaatgtgg gcactaaaga agcctttcca 540 

accatttcaa agaactagaa gttttcgaat gtaataatac ttccacagca acaggtgcta 600 

gagaccactg ttgttgtttt gagtgaatgg tggttaggag aaagactttg gtggtggaag- 660 

aaagaaaagc ataaaacaaa gactactgaa atatagataa agattgcctt agtttttaaa. 720 

aatgtttggc cattagtatt tttataaaac tcaatgctag ttttaagtgt ataaattggt 780 

taaaatttat gagtcaaata tafcagtgata atgttaacat gtttgtaatt gctacagaat 840 

ttaagggtat ttttatctct gt^ctttctt tttcatggtg tttattaaat aattgtgtat 900 

atacatccta gctactgata tctttattat agccttaaga cttaatttta agtcttaaaa 960 

atagcgtgta tacttgaata agaaagacac tgggtactgt tactgtgatg ctattgactt 102 0 

agtagccaat tatcatttct cctgtataaa ttccagtttt tattgctgca cataaatttt 1080 

ttaatgtctt atattgtgat agctatgtct tttattgcag atttattgga tgttatgaca. 1140 

gattttacta aagctagtgt ttttataaca tatatattag ttgatgttta cctataagtg- 1200 

gagtagattt tcatctgcct gcaatggtat aatttcagtc ttagctaaaa atggaaagtt 12 60 

gaactggata aattctttgg gtacccttag acctctgatt ctaagtcaaa tgcaaatggg- 132 0 

ttaaataaaa tgagactact tcctttataa atatattttc atccttttga aagtaagtga. 1380 
aatgtaaata aacttatttt ttttaaaaat g 1411 



<210> 14 
<211> 60 
<212> DNA 
<213> Homo 



sapiens 
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<300> 

<308> AK001166 



<400> 14 

acccttagac ctctgattct aagtcaaatg caaatgggtt aaataaaatg agactacttc 6 0 



<210> 15 

<211> 2352 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> AL049367 



<400> 15 

ggcaaacccc ttttaaaatc taatgtctgg gctttgagta ttagctcatt tagggt^gac 6 0 

aaatgcatta ctgttttcaa actgctcaca tttattcagt atttctccaa gttgctSLtct 120 

actcagcctt atgaatgccc ctcgcttttc taaggccatg tgaaaatcac ggcactgccc 180 

ttagccttgt gtcatctg-ct ttttcgttct gcgatatgcc cagttcccaa atcaattata 240 

ggtacctgtt taggagagag gaagatttta cctctcaaag ggtgagattt gaaatttaca 3 00 

ctaaaaagac aactttacat ttaatgcttc acttaatgag acattctttt ttttataagt 3 60 

ctatttttct actcagtttc agaacactaa tctgattttc actctgattt ttaacgtttc 42 0 

tttaaatatt tataatgtag cttctttcaa aatattttca tgaaaaatta cttttattat 480 

accattatgt gcatgttoLtt ggtagcaggc atagtttatt atttagtact gaaacatgct 540 

cttttaccta acagtaaaca agtatgtttt gatatatatc tgttaatatg cttata^tgg 6 00 

taagaaatgg acttgagcjtc ccaggagatt tcattttatt caccctggtc agatacaata 660 

aaggctatga gtataaatac ataacttcct aaccaggtgt agggcatgtt catgaatatc 72 0 

aaatcttttg atgctgga.cc caagagagga aaagttgtag ctaaatgttg atttacttat 7 80 

aactagacgt ctatgtgaga aaatatatgt atacatatat atgatatgca gaagtcactt 840 

tttttatcag gctttattct ccttacaaag ccacagttta actgtctgca acagtt^gtt 900 

tatgttaatg atagacaaat acccagtgtt tgttactttt tccaactacc actgtaatga 960 

taatctttct cacgtatata catgcaactt cttggcttca tttccatgaa gctgtt-tcaa 102 0 

tatattcagt atactttcjtc cttaatgctg cttctgttaa cagtgatctc tttctttttt 1080 

tcattcttat atcttcatta gttcatcata aatctgtcca gttgaggcct caggaccacg 1140 

gcatgatttc atgactccga agtattttac agaaacattt tttaaataag ggaaatattt 12 00 

tatataccag atggttcaca agtgatggct catagctagt tttttttttt tcttctaaaa 1260 

aatgtcaggt ttttaaaatc atttacctta ttaaaatgaa aagtgccata cttaactttt 132 0 

aaaggaaaga cctgacttgc tttttctcta tttagactgt ttttgtactt tactaatctt 1380 

taaactatca ggaaaaaaac caaaacttta taccaatgat ttagtaattt tgaggcatag 1440 

ggtagcttac gtagtggaigg atgtgccaaa tattctcttc aaatgccacc ttctcaattt 1500 

ataactaaaa tagtgttatc tgactaattc ctctgaattt tgatgtaaga tctatatagg 156 0 

cccccaaaat gatcgtacjta catgccagtc atttctcagt gaaataaata caataccaga 162 0 

gtacattatg ggttttattg ctttctttta tggtagacct gttaatgggg aaaaaataca 1680 

tcaaatcaaa tagaatctita tatctgtatg ttaaaataga gcacttacct gaagtcagtg 1740 

gcctggatca tagccctg-ga tcatttccca gtctgtcctg tgctgtgtga ccttggacaa 1800 

ggcgcttcat ctctctgcjgc ctctatttct ccatttgtaa aacaagtggc tgcagtagat 1860 

gatggctgag agcccttcct gttcccagat gccttggtcc aaagacccca cccctctgct 192 0 

ggtcctgcca acgtgttcjgt gctataagct gcttcagata taaaattggt ttatctataa 19 80 

tgtttgttca tttaatag-ct tctaaaaggc ctttttgtta tacagtgctt tttttctagt 2 040 

tttatggact tgattactgt aataatgtct tgtttttagc catgtaacta caaacagata 2100 

ttctcttgat gtcttagtaa atttgcattt gatatatcat tgatgagatt ttgttcjttat 216 0 

gtaatattct ttggctacgc atctgtccag catcttatta accataatac tgtgatcatt 2220 

atttggaaat atgtcctatg gaaagaataa aagcatgtac ttcacagcta gcatgttcac 22 80 

agatttgaaa gaagtttcat taaaagcacc attgctttct gtaaaaaaaa aaaaaaaaaa 2340 
aaaaaaaaaa aa 23 52 



<210> 16 
<211> 60 
<212> DNA 
<213> Homo 



sapiens 
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<300> 

<308> AL049367 
<400> 16 

atttggaaat atgtcctatg gaaagaataa aagcatgtac ttcacagcta gcatgttcac 60 

<210> 17 

<211> 1130 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AL080235 
<400> 17 

ggtcgccgca ccggccgcct ccggcccgcc gccgccccca gcgccgccgc cgccaccgcc 60 

ggggcgccca ccgcgctgcc agcctacccc gcggccgagc cgcccgggcc gctgtggctg- 12 0 

cagggcgagc cgctgcattt ctgctgccta gacttcagcc tggaggagct gcagggcgag- 180 

ccgggctggc ggctgaaccg taagcccatt gagtccacgc tggtggcctg cttcatgacc 240 

ctggtcatcg tggtgtggag cgtggccgcc ctcatctggc cggtgcccat catcgccggc 3 00 

ttcctgccca acggcatgga acagcgccgg accaccgcca gcaccaccgc agccaccccc 3 60 

gccgcagtgc ccgcagggac caccgcagcc gccgccgccg ccgccgctgc cgccgccgcc 42 0 

gcggccgtca cttcgggggt ggcgaccaag tgacccgctc cgctcctccc tgtgtccgtc 480 

ctgtgtccgc gcgcgcgggt gcctttcccg ccggggactc ggccggtgtg cttcgtgctg- 540 

tagttatcgt tagttcctct tcccgagatg gggccgccga gagaccccag cgcctttgaa 600 

aagcaaggtt tgtgctgcgc ttccagttcc gaaaagcaga tgtttaagcc cttggactgo. 660 

gggtgggatc gcagctccga agacggagag gagggaaatg gggccctttc ccctctattg 720 

catccccctg cccgactcct tccccgcacc cacgtgccct agattcatgg cagaaaatga 780 

ccaaatcctg tgtatttgtt ttatatattt aataactgtt ttaaatgaaa gttttagtaet 840 

aaaaaataca aaacaaaaag attaaattgc tattgctgta gtaagagaag ctctttgtat 900 

ctgaacatag ttgtatttga aatttgtggt tttttaattt atttaaaatt ggggggaggg- 96 0 

catgggaagg atttaacacc gatatattgt taccgctgaa aatgaacttt atgaaccttt 102 0 

tccaagttga tctatccagt gacgtggcct ggtgggcgtt tcttcttgta cttatgtggt 1080 

tttttggctt ttaatacaga cattttcctc caaaaaaaaa aaaaaaaagg 113 0 

<210> 18 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AL080235 
<400> 18 

ctttgaaaag caaggtttgt gctgcgcttc cagttccgaa aagcagatgt ttaagccctt 60 

<210> 19 

<211> 2498 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AL137540 
<400> 19 

gctgaaacga cagtcttgto cctgtcagag aaatgacctg aacgaagagc ctcaacattt 60 

tacacactat gcaatctatg- atttcattgt caagggcagc tgcttctgca atggccacgc 12 0 

tgatcaatgc atacctgtto atggcttcag acctgtcaag gccccaggaa cattccacat 180 

ggtccatggg aagtgtatgt gtaagcacaa cacagcaggc agccactgcc agcactgtgc 240 

cccgttatac aatgaccggo catgggaggc agctgatggc aaaacggggg ctcccaacga 3 00 
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gtgcagaacc tgcaagtgta. atgggcatgc tgatacctgt cacttcgacg ttaatgtcjtg 3 60 

ggaggcatca gggaatcgta gtggtggtgt ctgtgatgac tgtcagcaca acacagaagg 42 0 

acagtattgc cagaggtgca agccaggctt ctatcgtgac ctgcggagac ccttctcagc 480 

tccagatgct tgcaaaccgt gttcctgcca tccagtagga tcagctgtcc ttcctgccaa 540 

ctcagtgacc ttctgcgacc ccagcaatgg tgactgccct tgcaagcctg gggtggca_gg 6 00 

gcgacgttgt gacaggtgca tggtgggata ctggggcttc ggagactatg gctgtcgacc 66 0 

atgtgactgt gcggggagct gtgaccctat caccggagac tgcatcagca gccacacaga 72 0 

catagactgg tatcatgaag* ttcctgactt ccgtcccgtg cacaataaga gcgaaccagc 7 80 

ctgggagtgg gaggatgcgc aggggttttc tgcacttcta cactcaggta aatgcgaatg 840 

taaggaacag acattaggaa atgccaaggc attctgtgga atgaaatatt catatgtcjct 9 00 

aaaaataaag attttatcag* ctcatgataa aggtactcat gttgaggtca atgtgaacjat 9 60 

taaaaaggtc ttaaaatcta ccaaactgaa gattttccga ggaaagcgaa cattatafc.cc 102 0 

agaatcatgg acggacagag- gatgcacttg tccaatcctc aatcctggtt tggaatacct 1080 

tgtagcagga catgaggata. taagaacagg caaactaatt gtgaatatga aaagctttgt 1140 

ccagcactgg aaaccttctc ttggaagaaa agtcatggat attttaaaaa gagagtgoaa 12 00 

gtagcattaa gatggatagc acataatggc acttgtctat gtacaaaaca caaactttag 12 6 0 

agcaagaaga cctcagacag gaaactggaa ttttttaaag tgccaaaaca tatagaaatg 132 0 

tttgaatgca tgggtcttat ctaacttatc tcttctggac ccatgtttaa atacagtttt 13 80 

atttcatgaa gagaaatgaa aacccctaca ctgatatctg ttttctatgg gactgatfcct 1440 

gaaattctta actattaaga atattttaat agcagcatga catttagcag taatccatta 1500 

agggcagtac ctctaacaag gacgccttcc agcttcagcg atgttactta cgtttgatgc 156 0 

tacttaaagt aatgaatgac gttttaagga atccctaacc ctactatcag aaaaggtg-tt 162 0 

tgttaaagag ccttctcttcj tgtgttacgc atgaactttg gtctgtaggt gttaaatg-ga 16 80 

acctctccat gtgtatatacj tatttccttg tataaagcac tttactacct accacttgtg 1740 

ttgtgaacgt ttggtgactcj ctgttgaaag aaggaaaagg gtgtgtgaga aagcctactg 1800 

aagcagcagc actgccacta catgtggaca aaagtgacca tataaaagaa gttgtgctat 1860 

ttaactctga atacttggacj aaactaggtg aagatgcaac cagaaaggag aatatgtatg 192 0 

cgtgaagtct cagctttgacj ctggaggcta gattccaaga tgacagccat gatgaaactt 1980 

tttaaaaaac taaaccagaa gagactttaa aataagagaa agaaatcata aatgtagaca 2 040 

tatgcttggc taaaggggaa atggacttta aattttaaag agctcatttg caatgcactt 2100 

gtatacactt caaaaattat tgtagacaca gaatttgtta tatttttgtg cttagtattt 2160 

aaacctgaac attgaaacacj ttttcctcct tgtctttctt aacagtaata gtcattatat 2220 

ttacctgttt tttaacacaa tgtatgtgat agtcaaaaaa tcacagtttt teat tat tat 22 80 

tcatcttctg tacccacgca taaccactat acatagtttc ttttgtactt gaatatacaa 2340 

aacatgaaca cagtgecata tgaataattt cacatacaga accttttttt ctctgaac/tc 2400 

ctgtggactt gcaaatatat atatatattg ctttgttaat ttgtttttat atttcatata 2 46 0 
tgtaataaag gaatatgato tgaaaaaaaa aaaaaaaa 2 49 8 



<210> 20 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> AL137540 



<400> 20 

tggaggctag attccaagat gaeagecatg atgaaacttt ttaaaaaact aaaccagaag 60 

<210> 21 

<211> 914 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> AL160131 



<400> 21 

cgcaccgcag gagcaaeggt tggtcctgcg 

aagctgcccg gcctgaacac ggccaccatc 

cagcagctgg eggactcgat gctcaaagag 



gctgtgatgt cggtgttgag gcccctggac 6 0 
ttgctggtgg geaeggagga tgctcttctg 12 0 
gactgcgcct ccgagctgaa ggtccacttg 180 
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gcaaagtccc tccctttgcc ctccagtgtg aatcggcccc gaattgacct gatcgtgttt 240 

gtggttaatc ttcacagcaa atacagtctc cagaacacag aggagtccct gcgccatgtg- 3 00 

gatgccagct tcttcttggg aaaggtgtgt ttcctcgcca caggtgctgg gcgggagagc 3 60 

cactgcagca ttcaccggca oaccgtggtg aagctggccc acacctatca aagccccctg 42 0 

ctctactgtg acctggaggt g-gaaggcttt agggccacca tggcgcagcg cctggtgcgc 480 

gtgctgcaga tctgtgctgg ccacgtgccc ggtgtctcag ctctgaacct gctgtccctg- 540 

ctgagaagct ctgagggccc ctccctggag gacctgtgag ggtggctggc ccctgggct^ 6 00 

ccccttctca tggcttcgtg ctgactccat aaacattctc tgttgaggat gtccagtcag 660 

ggcttgacag gcccaggctc agcccgccgt ggctgggaag gttccctgca gtgccagtgc 72 0 

tgcagcaggg agagctgggc agaagcagcg agggggccca gctggcgaga ctgtagcccc 7 80 

ctcccactcc cacactcact cttgcagagc ctgtgtcttt aagcagctgg cgtgttacat: 840 

ctccatttaa ggtttccttt gaacaaaagg tctgtggcta aaaaaagttt aaaaatcact 9 00 
ggtctcattc acca 914 

<210> 22 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AL160131 
<400> 22 

agctggcgtg ttacatctcc atttaaggtt tcctttgaac aaaaggtctg tggctaaaaa 6 0 

<210> 23 
<211> 4753 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> D13642 
<400> 23 

cttcaatcaa gtagccttcc 
gagtaacaac cttattatca 
tcagagaaag cagcagatgg 
gctggccgca gagatggcag 
agctcccaag gctggcaatg 
agggaacaca ctggaccttg 
gtgcaggttt tccaacactg 
gatactaaac ccccgatctg 
tggggaaaaa ctggagtttt 
cccattccag gggagggtgt 
aaagaagaag ttactccgaa 
ccagactatt ggacataggg 
ctacaagcgt aatgaaaacc 
cactacagcc agcctcctgg 
atgtgtggtg aggctcccac 
caaagccctg tgggaccgtg 
gaactaccat gtcggggaga 
ctcagaatca cttgtctata 
gtcccatgag gaccatgact 
ccctctctgt gggcgggacc 
gattgatgga gacctctgtg 
ctctgaagaa ctggaccgaa 
ccgctacgcc ttctgagccc 
ttcccccacc accatcactg 
attaagactg cattatgaaa 
tggcttcccc tcaaattggc 
catgacccca ggttccagtg* 



cactgcagta cacacccagg aaatttgtca tccaccctga 6 0 

ttgaaacgga ccacaatgcc tacactgagg ccacgaaagc 12 0 

cagaggaaat ggtggaagca gcaggggagg atgagcggga 180 

cagcattcct caatgaaaac ctccctgaat ccatctttg-g 240 

ggcagtgggc ctctgtgatc cgagtgatga atcccattca 3 00 

tccagctgga acagaatgag gcagctttta gtgtggctgt 3 60 

gtgaagactg gtatgtgctg gtgggtgtgg ccaaggacct 420 

tggcaggggg cttcgtctat acttacaagc ttgtgaacaa 480 

tgcacaagac tcctgtggaa gaggtccctg ctgctattg-c 540 

tgattggtgt ggggaagctg ttgcgtgtct atgacctgg-g 600 

aatgtgagaa taagcatatt gccaattata tctctgggat 660 

taattgtatc tgatgtccaa gaaagtttca tctgggttog 72 0 

agcttatcat ctttgctgat gatacctacc cccgatggg-t 7 80 

actatgacac tgtggctggg gcagacaagt ttggcaacat 840 

ctaacaccaa tgatgaagta gatgaggatc ctacaggaaa 900 

gcttgctcaa tggggcctcc cagaaggcag aggtgatcaa.t 9 60 

cggtgctgtc cttgcagaag accacgctga tccctggag-g 102 0 

ccaccttgtc tggaggaatt ggcatccttg tgccattcac 1080 

tcttccagca tgtggaaatg cacctgcggt ctgaacatcc 1140 

acctcagctt tcgctcctac tacttccctg tgaagaatgt 12 00 

agcagttcaa ttccatggaa cccaacaaac aaaagaacgt 12 6 0 

ccccacccga agtgtccaag aaactcgagg atatccggac 13 2 0 

tcctttcccg gtggggcttg ccagagactg tgtgttttg-t 13 80 

ccacctggct tctgccatgt ggcaggaggg tgactggafca 1440 

gtcaacagct ctttcccctc agctcttctc ctggaatgac 1500 

actgagattt gctacacttc tccccacctg gtacatgata 1560 

tagaacctga gtcccccatt ccccaaagcc atccctgcat 162 0 
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tgatatgtct tgactctcct gtctactttt gcacacaccc ttaattttta attggttttc 1680 

ttgtaaatac agttttgtac aatgttatct ctgtgggagg aaggaggcag gctgtggtgg 1740 

gactgggtag ggtatagtat cactcctgag ttccactgct ctagaatcta accagaaata 180 0 

gaaacctagt ttttaaggtg actggcatcc atgtgtcttg ttctggagat gaggatgtag 1860 

gtgggaggtt tgaacccaag ttagagcagg aagaactgag tagactcctt ccttccagat 192 0 

accgacttgg acttgcggca ctctgtggct ccccaccccc aggtctgtgg tggtttcttt 1980 

gttttttcct ggttcttttt gctgtgctga tgaaacatga cctcaataac catgtgts.ta 2 04 0 

cccacccctc ttcccactgg gtattgagga agggtggctg attcttcctc ctcttctact 2100 

ctgaggatgt tagtatgggg attttagcat gaattccagc tggggagtct taacagatgc 216 0 

cccttttact gatagagcac ctaaagcgat ctttggctcc ataggaccat aggaaggg*tc 2220 

agtacagaag aacctagata ctgccctgcc cctgagaact gtgtatatgt ggggcctg-tc 22 80 

tgcagcaccc atctcaggtg ggttccagag ggcctttagg gtataatgag agcctgt-tag 2340 

gtggaagagg cccagttcca gaaatgttcc agcccacccc tgagaattcc tcctgtttag 2400 

ttgtgtggga agccctcgtc ttccaggctg tccttgcgcc ttgaacctgg agaagtgagc 246 0 

tcactgttct caatacttca caaatgtaaa actttctttc gtctgcatgt gctcagccat 2 52 0 

ctaaattgag caaatgatct ggtgagcact gggttagaat caggaatggt ggaatacaat 2580 

ctgaacctct cagagcccag aacagagggt tcctgacact gtgacactgt ctcctggaac 2 640 

taagtatctc ttgaatcatg acttggtttt agatcagtca agagagaccc aggttttgcc 27 0 0 

aggaatcgaa tccctaaata acatgttttt ttctcactta gctcatgaat ttgcatagta 2760 

gacagtagtt ctgaattaga ttttgaaaac ctaatttcag ggctcatttt ttcctgt^-gc 2 82 0 

cctaaatcca ttctatcaaa ttgtgtgata ctgacatgca gtcatctgag gaactcagcg 2880 

tagatacttg agcagctcct cgcctctttt ctaactcaag tttgactaaa atacatacac 2 940 

tccgtacaga aggtaggggg ttatgtaaga aaggaaaacc taatctatgg aatcaggagt 3 0 00 

tgtcaccacc gagcttcctc tggaagtctg cccatcagct tgcttgttct ctgttaagag 3060 

gaagggctag gacaaggatt tgggcttgaa tatgtggaaa ggaattttca tagttgt tgc 312 0 

tgcaggacct acaaaagttt aaaattagat tggatgtgac tcaatgacaa gtcccatctg 3180 

tgtaattgtt aaggggacct gattgactcc tgtggtttga ttgagcaacc aggtaaatag 3240 

agacctctct ccagctttgo; caaaacccat cagaggctgc tgcagaactc agacagaggg 33 00 

atctgccctt gggtttgctt ccatcctgtt ccattgctaa gcccttgtga cttggatcct 33 60 

aggactgaaa agtttttagc tgcctcagct ttcccctgac cttactggca gaggttctgc 3 42 0 

agatgtttcc tttggaagat ctcttgccaa gaatagcatt cctttggagg agggggg-ttc 3480 

tagttggaat gttgcttttc ttggttagtg taaatgtatt gctagtgaga cagctgccgg 3540 

cgctggaaaa ggctcgtctc acagggagag tgctggtccc cagaatgtgt gctgttccca 3 6 00 

cgctgctgcc tttcttgagc ttgttagagg aaagccagaa aggcattcag atgggatcag 3 660 

tctggctttc aaattttttt taattcctaa gttctgtttt attttttaat tttttaaaaa 372 0 

aaattttatt agagacagtc tctctctctt gcctagctgg gagtgcagtg gagtgatcat 37 80 

agctcactga ggcttgaact cctgggctcg agcaatccac ctcagcctcc agagtacjggg 3 840 

agactacaga tgtgtgccac catactcagc tagtttttaa actttcgtag agacagggtc 3 90 0 

tccctgtgtt gcccaggctg- gcctcgaact cctgacctca aaaaatcttc ctgccttggc 39 60 

ctcccagcgc tttgagaggc tgaggcagga ggatcccttg agcccaggag tttgagacca 402 0 

gcctgggcaa catgacaaaa ccccatctct ccaaaaatac aaaaattggc caggcafcggt 4080 

ggtgcacact tgtagtccca. gtaattaggg ggctgagaca ggaggatcac ttcagcctat 4140 

gagtttgagg ctgcagtgac/ ctgtgattgc gccactacac tccagcctgg atgacaggac 42 0 0 

gaaacctgtc tcaaaaacac caaaaaacaa aaaccggtct cctggggtca tggtagcaca 42 60 

aacgcacatg actgagtgct caggggttct gaggcttgtc cgctgacctg gggctcfcggc 432 0 

cctgggagat ctgggggacc tgctgtccta tatgtgatgc tttgaaagaa aggggcatca 43 80 

ttccaagcca agaggcccca gagagggcac cgtggggtgt tcaggcttct gtgaggcccc 444 0 

agtgagatcc tgtggctgtg* cccccatcac ctccacccac tctgccctcc cactagctgc 45 00 

ccaacggatg aatcaacgcc ttggcagagt tttccagcag ggccttgcag agagtgfcgtg 45 6 0 

tgacctgtgt ggccactgcc ttggggacgg gtgaggagtt agcctggaac attccagcgt 462 0 

gggcattatt gtcctgttgc aagttcaggg caaaaccagg aatccagttt tgtcgatcca 4680 

attgagaaaa catttcatga. acaactactt gtggcatgca ttggcactcg gaataaagcg 4740 
cactattgtc act 4753 

<210> 24 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<3 00> 

<308> D13642 
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<400> 24 

aaaccaggaa tccagttttg" tcgatccaat tgagaaaaca tttcatgaac aactacttgt 60 

<210> 25 

<211> 2591 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> D25328 



<400> 25 

cccggacgtg 

aagggctcct 

ctgaccagcg 

ggtatctacg 

ggaggctcaa 

gggacgatca 

gctgcttgca 

agcctcaccg 

aggaacggcc 

atggtgggct 

gccctgcaca 

aggaccttcg 

ttggcctgcg 

gagcagatgt 

attgtggctg 

gagcttgtcg 

agaggaggga 

gtcatcgcct 

aaccacgccg 

gcgatggacg 

aacctgaaca 

aattgcaacg 

cgctcagctg 
tttgacggct 
accggccaag 
gagatcgcca 
gaggcctacc 
gtccccatgg 
atcggggcag 
gccagcggaa 
ctggccaaca 
ttcgacatca 
atccagagag 
atttaccagc 
ggtcacatgc 
tctgccagag 
aaatttacca 
caacctgtgg 
tggtggctca 
gtgtcggact 
tgcatgtgcc 
gcactttatg 
cagctccagt 
gaattaatta 

<210> 26 
<211> 60 



cggctcccct 
tgcggaagtt 
gcggggatgo 
tgggggcca<a 
acatcgcaga 
ttggcagtgo 
acctgctgca 
gggccaacc t 
agatcgataa 
ccatcgacaa 
ggatcatcgei 
ttctggagg-t 
gtgcggactg- 
gtgtcaaact 
aaggagcaat 
tcacgcagct 
ccccttcggc 
tgctagagg-c 
tgcgcctgcc 
agaggagafc. t 
cctacaagcg 
tagctgtcoLt 

tgcgcgtgg-g 
tcgccaaggg 
gaggctcceLt 
cacagatgcg 
tgggactcct 
tcatggttcc 
acaccgccct 
ccaagcggcg 
tgggggggot 
gggatctgca 
gccttgtgct 
tgtattcag-a 
agcagggtQ-g 
ctatggagtg 
ccgatgattc 
cagagctgaa 
agctacggcc 
caggccagct 
tgcagccacc 
cacgtattat 
gcgtgctgtc 
a 2591 



cggcctcctc 

cctggagcac 

tcaaggtatg 

ggtgtacttc 

ggccgactgg 

gcggtgccag 

gcgcggcatc 

cttccggaag 

ggaggccgtg 

tgatttctgc 

ggtcgtcgac 

gatgggacga 

ggtgttcctt 

ctcggagaac 

tgatacccaa 

gggctatgac 

attcgacagg 

caccccggac 

gctgatggag 

tcaagatgcg 

acttgccatc 

caacgtgggg 

cattgccgac 
ccagatcaaa 
tcttgggaca 
cacgcacagc 
ggagctgtca 
cgctactgtg 
gaacactatc 
cgtgttcatc 
cgcggccgga 
gtccaacgtg 
cagaaatgag 
agagggcaaa 
ggcaccctct 
gatcactgca 
catttgtgtg 
gaagcaaacg 
cctcatgaaa 
ggaacatgtg 
gtggactgtc 
tgacattaat 
tgtggagtgt 



gccatggacg 
ctctccgggg 
aacgctgccg 
atctacgagg 
gagagtgtct 
gccttccgca 
accaacctgt 
gagtggagtg 
cagaagtacg 
ggcaccgaca 
gccatcatga 
cactgtgggt 
ccagaatctc 
cgtgcccgga 
aataaaccca 
acacgtgtga 
atcttggcca 
accccagctt 
tgcgtgcaga 
gttcgactcc 
aagctgccgg 
gcacccgcgg 

ggccacagga 
gaaatcggct 
aaacgcgttc 
atcaacgcgc 
gccgcccggg 
tccaacaatg 
accgacacct 
atcgagacca 
gctgatgccg 
gagcacctga 
agctgcagtg 
ggcgtgtttg 
ccatttgata 
aaactcaagg 
ctgggaataa 
gattttgagc 
atcctggcca 
cagccctgga 
tgtttttgta 
acctaatcgg 
gtctcatgct 



cggacgactc 

ccggcaaggc 

tccgtgccgt 

gctaccaggg 

ccagcatcct 

cgcgggaagg 

gtgtgatcgg 

ggctgctgga 

cctacctcaa 

tgaccatcgg 

ccacggccca 

acctggccct 

caccagagga 

aaaaaaggct 

tcacctctga 

ccatcctcgg 

gccgcatggg 

gcgtcgtgtc 

tgactcagga 

gagggaggag 

atgatcagat 

ctgggatgaa 

tgctcgccat 
ggacagatgt 
tcccggggaa 
tgctgatcat 
agaagcacga 
tgccgggttc 
gcgaccgcat 
tgggcggcta 
catacatttt 
cggagaaaat. 
aaaactacac 
actgcaggaa 
gaaactttgg 
aggcccgggg 
gcaaaagaaa 
acaggattcc 
agtacaaggc 
gtgtctgacc 
acacttaagt 
cgagtgccca 
ttcagatgtg 



ccgggccccc 60 

catcggcgtg 12 0 

ggtgcgoatg 180 

catggt^gac 240 

gcaagt^ggc 3 00 

ccgcctgaag 3 60 

cgggga cggg 42 0 

ggagct cjgcc 480 

cgtggt^ggc 540 

cacggactcc 600 

gagccaccag 66 0 

ggtgag-tgcc 72 0 

aggctg-ggag 7 80 

gaatattatt 840 

gaaaat caaa 900 

gcacgtgcag 9 60 

agtgga_g-gca 102 0 

actgaacggg 1080 

tgtgca-gaag 1140 

ctttgcgggc 12 00 

cccaaagacc 12 6 0 

cgcggccgta 132 0 

ctatgaLtggc 13 80 

cggggg-ctgg 1440 

gtacttggaa 15 00 

cggtggattc 156 0 

ggagttctgt 1620 

cgatttcagc 1680 

caagcagtcc 1740 

ctgtgg-ctac 1800 

cgaagagc c c 1860 

gaagaccacc 192 0 

caccgacttc 19 80 

gaacgtgctg 2 040 

aaccaaaatc 2100 

cagaggaaaa 2160 

cgttattttt 2220 

caaagaacag 22 80 

cagctatgac 23 40 

cagtcccgcc 2400 

tattttatca 2460 

tctgccccac 2 52 0 

catatgagca 25 80 
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<212> DNA 

<213> Homo sapiens 

<300> 

<308> D25328 
<400> 26 

tattttatca gcactttatg cacgtattat tgacattaat acctaatcgg cgagtgccca 6 0 

<210> 27 

<211> 2573 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> D50402 



<400> 27 

gaatcggccg atgtgaaccg aatgttgatg taagaggcag ggcactcggc tgcggatggg 60 

taacagggcg tgggctggca cacttacttg caccagtgcc cagagagggg gtgcag^ctg 12 0 

aggagctgcc cagagcaccg ctcacactcc cagagtacct gaagtcggca tttcaatgac 180 

aggtgacaag ggtccccaaa ggctaagcgg gtccagctat ggttccatct ccagccogac 2 40 

cagcccgacc agcccagggc cacggcaagc acctcccaga gagacctacc tgagtgagaa 3 00 

gatccccatc ccagacacaa aaccgggcac cttcagcctg cggaagctat gggccttcac 3 60 

ggggcctggc ttcctcatga gcattgcttt cctggaccca ggaaacatcg agtcagatct 42 0 

tcaggctggc gccgtggc^g gattcaaact tctctgggtg ctgctctggg ccaccgtgtt 480 

gggcttgctc tgccagcgac tggctgcacg tctgggcgtg gtgacaggca aggact-tggg 540 

cgaggtctgc catctctact accctaaggt gccccgcacc gtcctctggc tgaccatcga 600 

gctagccatt gtgggctccg acatgcagga agtcatcggc acggccattg cattcaatct 66 0 

gctctcagct ggacgaatcc cactctgggg tggcgtcctc atcaccatcg tggacacctt 72 0 

cttcttcctc ttcctcgata actacgggct gcggaagctg gaagcttttt ttggactcct 7 80 

tataaccatt atggccttgra cctttggcta tgagtatgtg gtggcgcgtc ctgagcaggg 840 

agcgcttctt cggggcct^t tcctgccctc gtgcccgggc tgcggccacc ccgagctgct 9 00 

gcaggcggtg ggcattgttg gcgccatcat catgccccac aacatctacc tgcactcggc 9 60 

cctggtcaag tctcgagaga tagaccgggc ccgccgagcg gacatcagag aagccaacat 102 0 

gtacttcctg attgaggcca ccatcgccct gtccgtctcc tttatcatca acctctttgt 1080 

catggctgtc tttgggcagg ccttctacca gaaaaccaac caggctgcgt tcaacatctg 1140 

tgccaacagc agcctccacg actacgccaa gatcttcccc atgaacaacg ccaccgtggc 12 00 

cgtggacatt taccaggg-^g gcgtgatcct gggctgcctg ttcggccccg cggccctcta 12 60 

catctgggcc ataggtctcc tggcggctgg gcagagctcc accatgacgg gcacctacgc 132 0 

gggacagttc gtgatgga_gg gcttcctgag gctgcggtgg tcacgcttcg cccgtgtcct 13 80 

cctcacccgc tcctgcgcca tcctgcccac cgtgctcgtg gctgtcttcc gggacotgag 1440 

ggacttgtcg ggcctcaatg atctgctcaa cgtgctgcag agcctgctgc tcccgttcgc 1500 

cgtgctgccc atcctcacgt tcaccagcat gcccaccctc atgcaggagt ttgcca_atgg 1560 

cctgctgaac aaggtcgtca cctcttccat catggtgcta gtctgcgcca tcaacctcta 162 0 

cttcgtggtc agctatctgc ccagcctgcc ccaccctgcc tacttcggcc ttgcag-cctt 1680 

gctggccgca gcctacctgg gcctcagcac ctacctggtc tggacctgtt gccttgccca 1740 

cggagccacc tttctggccc acagctccca ccaccacttc ctgtatgggc tccttgaaga 1800 

ggaccagaaa ggggagacct ctggctaggc ccacaccagg gcctggctgg gagtggcatg 1860 

tatgacgtga ctggcctgct ggatgtggag ggggcgcgtg caggcagcag gatgga.gtgg 192 0 

gacagttcct gagaccagcc aacctggggg ctttagggac ctgctgtttc ctagco/cagc 19 80 

catgtgatta ccctctggrgt ctcagtgtcc tcatctgtaa aatggagacg ccaccaccct 2 040 

tgccatggag gttaagcact ttaacacagt gtctggcact tgggacaaaa acaaacaaac 2100 

aaacaaaaaa catttcaaaa ggtatttatt gagcacctgc aggcgtgacc tgacagccca 2160 

agggtgggtg gggtgagg-gc ttgaggactt gggcgggaca caggctccaa actgga.gctt 222 0 

gaaatagtgt ctgatgaatg ttaaattatc tatctatcta tttatttatt tatfctg-agac 22 80 

agggaaaggg tctccctctg ttgccaaggc tggagtgcag tggcgcaatc ttaactcatt 2340 

gcaacctcca ccttctgg-gt tcaagcgatt ctctttattc agccccggga gtggc^cgcg 2400 

ccaccacgcc cagctaattt gtgtattttc agcagagacg gggtttgcca tgctggccag 2460 

gctggtctcg aactgctgga ttcaagtgat ccgcccatct ccgtctccca aagtgctggg 252 0 
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aattacaggc gtgagccacc aaaacccggc ctgattaaag ttaaataaat acg 2 573 

<210> 28 
<211> 60 
<212> DNA 

<213> Homo sapien_s 
<300> 

<308> D50402 
<400> 28 

tggaggttaa gcactttaac acagtgtctg gcacttggga caaaaacaaa caaacaaaca 60 

<210> 29 
<211> 3672 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> L27560 
<400> 29 

acatgtgcat atttcattcc ccaggcagac attttttaga aatcaataca tgccccaata 6 0 

ttggaaagac ttgttcttcc acggtgacta cagtacatgc tgaagcgtgc cgtttcagcc 12 0 

ctcatttaat tcaatttgta agtagcgcac gagcctctgt gggggaggat aggctgaaaa 180 

aaaaaagtgg gctcgtattt atctacagga ctccatatag tcatatatag gcatataaat 240 

ctatgctttt tctttgtttt tttctttctt cctttctttc aaaggtttgc attaactttt 300 

caaagtagtt cctatagggg cattgaggag cttcctcatt ctgggaaaac tgagaaaacc 3 60 

catattctcc taatacaacc cgtaatagca tttttgcctg cctcgaggca gagtttcccg 42 0 

tgagcaataa actcagcttt tttgtggggc acagtactgg atttgacagt gattccccac 480 

gtgtgttcat ctgcacccac cgagccaggc agaggccagc cctccgtggt gcacacagca 540 

cgcgcctcag tccatcccat tttagtcttt aaaccctcag gaagtcacag tctccggaca 600 

ccacaccaca ttgagcccaa caggtccacg atggatccac ctagtcccac cccagccttt 6 60 

ttctttcatc tgaacagaat gtgcattttt ggaagcctcc ctcactctcc atgctggcag 72 0 

agcaggaggg agactgaagt aagagatggc agagggagat ggtggcaaaa aggtttagat 780 

gcaggagaac agtaag-atgg atggttccgg ccagagtcga tgtggggagg aacagagggc 840 

tgaagggaga gggggctgac tgttccattc tagctttggc acaaagcagc agaaaggggg 900 

aaaagccaat agaaatttcc ttagcttccc caccatatgt attttcatgg atttgagagg 960 

aaagagagga aaatg^ggga atgggttgca aaatagaaat gagcttaatc caggccgcag 102 0 

agccagggaa ggtgagtaac cttaggaggg tgctagactt tagaagccag ataggaagaa 1080 

tcagtctaaa ctggccatgc tttggaaggg acaagactat gtgctccgct gcccaccttc 1140 

agcctgcaat gagggactga ggcccacgag tctttccagc tcttcctcca ttctggccag 12 0 0 

tccctgcatc ctccctgggg tggaggatgg aaggaaagct gggacaagca gggaacgcat 12 60 

gattcaggga tgctg-tcact cggcagccag attccgaaac tcccattctc caatgacttc 132 0 

ctcaaccaat gggtgg-cctt gtgactgttc tttaaggctg aagatatcca ggaaa-aggggg 13 80 

cttggacact ggccaaggag accccttcgt gctgtggaca cagctctctt cactctttgc 1440 

tcatggcatg acacagcgga gaccgcctcc aacaacgaat ttggggctac gaagaggaat 15 00 

agcgaaaaag caaatctgtt tcaactgatg ggaaccctat agctatagaa cttg-ggggct 156 0 

atctcctatg cccctggaca ggacagttgg ctggggacag gagaagtgct caafccttcat 162 0 

gagacaaagg ggcccgatca aggcagccac aaggccttga cctgccgagt cagcatgccc 1680 

catctctctc gacagctgtc ccctaaaccc aactcacgtt tctgtatgtc ttag-gccagt 1740 

atcccaaacc tcttccacgt cactgttctt tccacccatt ctccctttgc atcttgagca 1800 

gttatccaac taggatctgc caagtggata ctggggtgcc actcccctga gaaaagactg 1860 

agccaggaac tacaagctcc ccccacattc ctcccagcct ggacctaatt cttgagaggg 192 0 

gctctctctt cacggactgt gtctggactt tgagcaggct tctgcccctt gcgfctggctc 19 80 

tttgctgcca gccatcaggt gggggattag agcctggtgt aagtgcgcca gactcttccg 2 040 

gtttccaaag ttcgfc^cctg cgaacccaaa cctgtgagtc tcttctgcat gcag-gagttt 2100 

ctcctgggca gctgg-tcact ccccagagaa gctgggcctt catggacaca tggaactaag 2160 

cctcccaaat gggag-ttctg gctgagccca gggtggggag atcctgggaa gggsiggcact 222 0 

ggaggaagac ggcacctctt cccccatggc agggtgtgag ggaggcaggt tfcggaatggt 22 80 

gcgagtatgg caatctaagc aggggtctgg tctctttgac tccaggctcg ctttggccga 23 40 
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ctgtctgctc acccagagac cttggactcc ggactatcca tggctccgaa tctaagtgct 2400 

gcccactccc atgctcacac ccacagaagg tcttcccatc ccctttagat tcgtgcctca 2 46 0 

ctccaccagt gaggaag-atg cctctgtctt tcccacgact gccaggagat agggaagccc 2 52 0 

agccaggact gaccctcctt cctccagcct gccctgaccc acctggcaaa gcagggcaca 25 80 

tggggaggaa gagactg-gaa cctttctttg acagccaggc ctagacagac a.ggcctgggg 2 640 

acactggccc atgaggggag gaaggcaggc gcacgaggtc cagggaggcc cttttctgat 27 00 

catgcccctt ctctcccacc ccatctcccc accaccacct ctgtggcctc catggtaccc 2 76 0 

ccacagggct ggcctcccct agagggtggg cctcaaccac ctcgtcccgc cacgcaccgg 2 82 0 

ttagtgagac agggctgcca cgcaaccgcc aagcccccct caaggtggga cagtaccccg 2 880 

gacccatcca ctcactcctg agaggctccg gcccagaatg ggaacctcag a.gaagagctc 2 940 

taaggagaag aaaccccata gcgtcagaga ggatatgtct ggcttccaag agaaaggagg 3 00 0 

ctccgttttg caaagtggag gagggacgag ggacaggggt ttcaccagcc agcaacctgg 3 06 0 

gccttgtact gtctgtgttt ttaaaaccac taaagtgcaa gaattacatt g-cactgtttc 312 0 

tccacttttt attttctctt aggcttttgt ttctatttca aacatacttt ottggttttc 3180 

taatggagta tatagtttag tcatttcaca gactctggcc tcctctcctg aaatcctttt 3240 

ggatggggaa agggaa^gtg gggagggtcc gaggggaagg ggaccccagc ttccctgtgc 33 00 

ccgctcaccc cactccacca gtccccggtc gccagccgga gtctcctctc taccgccact 33 60 

gtcacaccgt agcccacatg gatagcacag ttgtcagaca agattccttc agattccgag 342 0 

ttgctaccgg ttgttttcgt tgttgttgtt gttgtttttc tttttctttt fcttttttgaa 3480 

gacagcaata accacagtac atattactgt agttctctat agttttacat acattcatac 3540 

cataactctg ttctctcctc ttttttgttt tcaactttaa aaacaaaaat aaacgatgat 3 600 

aatctttact ggtgaaaagg atggaaaaat aaatcaacaa atgcaaccag tttgtgagaa 3 660 
aaaaaaaaaa aa 36 72 

<210> 30 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> L27560 
<400> 30 

agcaacctgg gccttgtact gtctgtgttt ttaaaaccac taaagtgcaa gaattacatt 6 0 

<210> 31 

<211> 1416 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> Modif ied_base 

<222> 1 . . . 1416 

<223> n = a,c,g, or t 

<30.0> 

<308> M55914 
<400> 31 

aggaattccg gaattccgga attccgatgg atggaacaga aaataaatct 
cgaacgccat tctggg-^gtg tcccttgccg tctgcaaagc tggtgccgtt 
tcccctgtac cgccacatcg cgtacttggc tggcaacttc gaagtcatcc 
ggcgttcaag tgtcatcatc aatggcggtt ctcatgctgg caacaagctg 
gtctgtcctc ccagtccjgtg cagcaaactc agggaagcca tgccgcattg 
ttaccacaac ctgaagaatg tcatcaagga gaaatatggg aaagatgcca 
gatttgcgcg ggtttg~ctcc caacatcctg gagaataaag aaggcctgga 
actgctattg gaaagcctgg cctacactgt aaaggtggtc atggcatgga 
tccgagttct tcaggtcagg gaactatgac ctggacttca agtctcccga 
aggtacatct cgcctgacca gctggctgac ctgtacaagt ccttcatcaa 
gtggtgtcta tcgaagjatcc ctttgaccag gatgactggg gagcttcaga 
cagtgcagga atccag-cjtag tggggggatg actcacagtg accaacccaa 



sagtttggtg 60 

g-agaaggggg 12 0 

tgccagtccc 180 

gccatgcaga 240 

gagcagaggt 3 00 

ccaatgtggg 3 60 

gctgctgaag 42 0 

cgtagcggcc 480 

tgaccccagc 540 

ggactaccca 600 

agttcacagc 660 

agaggatcgc 720 
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caaggcgtga 
gtgaccgagt 
tctcatcgtt 
actggggcag 
ctcctcagaa 
aaccccttgg 
ccctcccctg 
tgctagttag 
aagctccctg 
attgtttttc 
tccggggtgg 
gacccatgaa 



acgagaagtc 
ctcttcaggc 
cgggg^agac 
atcaacjactg 
ttgaag-agga 
ccaagtaagc 
gtgtcsigctc 
cgcccaccgc 
gaagccctgt 
tcgccttact 
ccacaggcta 
aaaaaaaaag 



ctgcaactgc 
gtgcaagctg 
tgaagatacc 
gtgccccttg 
gctgggcagc 
tgtgggcagg 
aggcagctcg 
cgtggagttc 
tggcagctct 
ttccaccaag 
gatccccggt 
gaattccgga 



ctcctgctca aagtcaacca 
gcccaggcca atggttgggg 
ttcatcgctg acctggttgt 
ccgatcacgc gcttggccaa 
aaggctaagt ttgccggcag 
caagccttcg gtcacctgtt 
aggcccccga ccaacacttg 
gtaccgcttc cttagaactc 
agctttgcag ttgtgtaatt 
tgtctagagt catgtgagcc 
ggttttgtgc tcaaaataaa 
attccg 1416 



gattggctcc 7 80 

cgtcatggtg 840 

ggggctgtgc 900 

gtacaaccag 960 

gaacttcaga 102 0 

ggctacagac 1080 

caggggtccc 1140 

tacagaagcc 12 0 0 

ggcccaagtc 12 6 0 

tngtgtcatc 132 0 

aagcctcagt 13 80 



<210> 32 
<211> 60 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> M55914 



<400> 32 

gtaccgcttc cttagaactc tacagaagcc aagctccctg gaagccctgt tggcagctct 60 

<210> 33 
<211> 2517 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> M96577 
<400> 33 

ggaattccgt ggccgcjgact ttgcaggcag cggcggccgg gggcggagcg ggafccgagcc 60 

ctcgccgagg cctgccgcca tgggcccgcg ccgccgccgc cgcctgtcac ccgcjgccgcg 12 0 

cgggccgtga gcgtcatggc cttggccggg gcccctgcgg gcggcccatg cgccjccggcg 180 

ctggaggccc tgctccjgggc cggcgcgctg cggctgctcg actcctcgca gatcgtcatc 240 

atctccgccg cgcag-cjacgc cagcgccccg ccggctccca ccggccccgc ggcgcccgcc 3 00 

gccggcccct gcgaccctga cctgctgctc ttcgccacac cgcaggcgcc ccggcccaca 360 

cccagtgcgc cgcggcccgc gctcggccgc ccgccggtga agcggaggct ggacctggaa 42 0 

actgaccatc agtacctggc cgagagcagt gggccagctc ggggcagagg ccgocatcca 480 

ggaaaaggtg tgaaatcccc gggggagaag tcacgctatg agacctcact gaatctgacc 540 

accaagcgct tcctcjgagct gctgagccac tcggctgacg gtgtcgtcga cctcjaactgg 600 

gctgccgagg tgctgraaggt gcagaagcgg cgcatctatg acatcaccaa cgtccttgag 660 

ggcatccagc tcattgccaa gaagtccaag aaccacatcc agtggctggg cagccacacc 72 0 

acagtgggcg tcggcggacg gcttgagggg ttgacccagg acctccgaca gctcjcaggag 7 80 

agcgagcagc agctcj-gacca cctgatgaat atctgtacta cgcagctgcg cctgctctcc 840 

gaggacactg acagccagcg cctggcctac gtgacgtgtc aggaccttcg tagcattgca 900 

gaccctgcag agcacjatggt tatggtgatc aaagcccctc ctgagaccca gctccaagcc 960 

gtggactctt cggacjaactt tcagatctcc cttaagagca aacaaggccc gatcgatgtt 102 0 

ttcctgtgcc ctgacjgagac cgtaggtggg atcagccctg ggaagacccc atcccaggag 1080 

gtcacttctg aggacjgagaa cagggccact gactctgcca ccatagtgtc accaccacca 1140 

tcatctcccc cctcatccct caccacagat cccagccagt ctctactcag cctggagcaa 1200 

gaaccgctgt tgtcccggat gggcagcctg cgggctcccg tggacgagga ccgcctgtcc 12 6 0 

ccgctggtgg cggccgactc gctcctggag catgtgcggg aggacttctc cggcctcctc 132 0 

cctgaggagt tcatcagcct ttccccaccc cacgaggccc tcgactacca cttcggcctc 13 80 

gaggagggcg agggcatcag agacctcttc gactgtgact ttggggacct cacccccctg 1440 

gatttctgac agggcttgga gggaccaggg tttccagagt agctcacctt gtctctgcag 1500 

ccctggagcc ccctgtccct ggccgtcctc ccagcctgtt tggaaacatt taatttatac 1560 

ccctctcctc tgtctccaga agcttctagc tctggggtct ggctaccgct aggsiggctga 162 0 
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gcaagccagg aagggaagga gtctgtgtgg tgtgtatgtg catgcagcct acacccacac 1680 

gtgtgtaccg gg^gtgaatg tgtgtgagca tgtgtgtgtg catgtaccgg ggaatgaagg 1740 

tgaacataca ccfcctgtgtg tgcactgcag acacgcccca gtgtgtccac atgrtgtgtgc 1800 

atgagtccat ctctgcgcgt gggggggctc taactgcact ttcggccctt ttg-ctcgtgg 1860 

ggtcccacaa ggcccagggc agtgcctgct cccagaatct ggtgctctga ccaggccagg 192 0 

tggggaggct tt^gctggct gggcgtgtag gacggtgaga gcacttctgt cttaaaggtt 19 80 

ttttctgatt gaagctttaa tggagcgtta tttatttatc gaggcctctt tgg-tgagcct 2 040 

ggggaatcag caaaagggga ggaggggtgt ggggttgata ccccaactcc ctctaccctt 2100 

gagcaagggc ag-g-ggtccct gagctgttct tctgccccat actgaaggaa ctg*aggcctg 216 0 

ggtgatttat ttattgggaa agtgagggag ggagacagac tgactgacag ccatgggtgg 222 0 

tcagatggtg gg^tgggccc tctccagggg gccagttcag ggcccagctg ccccccagga 22 80 

tggatatgag atgggagagg tgagtggggg accttcactg atgtgggcag gag-gggtggt 23 40 

gaaggcctcc cccagcccag accctgtggt ccctcctgca gtgtctgaag cgcctgcctc 2400 

cccactgctc tgccccaccc tccaatctgc actttgattt gcttcctaac agctctgttc 2460 
cctcctgctt tg^ttttaat aaatattttg atgacgttaa aaaaaggaat tcgatat 2 517 

<210> 34 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> M96577 
<400> 34 

gtaggacggt gaLgagcactt ctgtcttaaa ggttttttct gattgaagct ttaatggagc 

<210> 35 
<211> 4437 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_000O 57 
<400> 35 

gcgcggcggc cg-tggttgcg gcgcgggaag tttggatcct ggttccgtcc gctaggagtc 
tgcgtgcgag ga_ttatggct gctgttcctc aaaataatct acaggagcaa ctagaacgtc 
actcagccag aacacttaat aataaattaa gtctttcaaa accaaaattt tcaggtttca 
cttttaaaaa gaaaacatct tcagataaca atgtatctgt aactaatgtg tcagtagcaa 
aaacacctgt attaagaaat aaagatgtta atgttaccga agacttttcc ttcagtgaac 
ctctacccaa caccacaaat cagcaaaggg tcaaggactt ctttaaaaat gctccagcag 
gacaggaaac acagagaggt ggatcaaaat cattattgcc agatttcttg cagactccga 
aggaagttgt atgcactacc caaaacacac caactgtaaa gaaatcccgg gatactgctc 
tcaagaaatt agaatttagt tcttcaccag attctttaag taccatcaat gattgggatg 
atatggatga ctttgatact tctgagactt caaaatcatt tgttacacca ccccaaagtc 
actttgtaag agtaagcact gctcagaaat caaaaaaggg taagagaaac ttttttaaag 
cacagcttta tacaacaaac acagtaaaga ctgatttgcc tccaccctcc tctgaaagcg 
agcaaataga tttgactgag gaacagaagg atgactcaga atggttaagc agcgatgtga 
tttgcatcga tgatggcccc attgctgaag tgcatataaa tgaagatgct caggaaagtg 
actctctgaa aactcatttg gaagatgaaa gagataatag cgaaaagaag aagaatttgg 
aagaagctga attacattca actgagaaag ttccatgtat tgaatttgat gatgatgatt 
atgatacgga ttttgttcca ccttctccag aagaaattat ttctgcttct tcttcctctt 
caaaatgcct tagtacgtta aaggaccttg acacatctga cagaaaagag gatgttctta 
gcacatcaaa agatcttttg tcaaaacctg agaaaatgag tatgcaggag ctgaatccag 
aaaccagcac agactgtgac gctagacaga taagtttaca gcagcagctt atlicatgtga 
tggagcacat ctgtaaatta attgatacta ttcctgatga taaactgaaa cttttggatt 
gtgggaacga actgcttcag cagcggaaca taagaaggaa acttctaacg gaagtagatt 
ttaataaaag tgatgccagt cttcttggct cattgtggag atacaggcct ga-ttcacttg 
atggccctat gg-agggtgat tcctgcccta cagggaattc tatgaaggag tteaaattttt 
cacaccttcc ctcaaattct gtttctcctg gggactgttt actgactacc accctaggaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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agacaggatt ctctgccacc aggaagaatc 
tacagaagtc ctttgtaagt agcaactggg 
aaagctctta tttcccagga aatgttctca 
atactgcttc aataaatgac ttagaaagag 
ttgacataga tgactttgat gatgatgatg 
ccagcaaatc ttccacagct gcctatcaac 
tatcagaaag actttcctca gccaagacag 
atataaactt ctcagagtca attcagaatt 
ccagaaatct gaaacatgag cgtttccaaa 
tgaagatttt tcataaaaaa tttggcctgc 
tcaatgctgc actgcttggt gaagactgtt 
gtttgtgtta ccagctccct gcctgtgttt 
tgagatcact tatcgtagat caagtccaaa 
atctgacagg tgataagact gactcagaag 
aagacccaat cataaaactt ctatatgtca 
tcatttctac tctgg-a.gaat ctctatgaga 
aagcacattg tgtcagtcag tggggacatg 
tgcttcgcca gaagtttcct tctgttccgg 
gggtacagaa ggacatcctg actcagctga 
gctttaacag acataatctg aaatactatg 
ttgattgcct agaatggatc agaaagcacc 
tctccaggcg agaatgtgac accatggctg 
ttgcttacca tgctgrcjcctc agtgattctg 
atcaggatgg ctgtcaggtt atctgtgcta 
cggacgtgcg atttg-tgatt catgcatctc 
aatctggcag agctggaaga gatggggaaa 
atgatgtgac cagactgaaa agacttataa 
gagaaactca cttcaataat ttgtatagca 
gcaggagaat acagcttttg gcctactttg 
agaaacaccc agatg-tttct tgtgataatt 
atgtgactga cgatgtgaaa agtattgtaa 
gaatgagaaa tataaaacat gtaggtcctt 
acattttctt gggga_gtaag agtgcaaaaa 
cttattcacg acacaatgcc gaaagacttt 
atgaagactt atatatcaat gccaatgacc 
aagcccaaac tgtac taaat ggcaatttaa 
gcagtgtgaa aaaacaaaaa gcgttagtag 
aaaaatgtct tggagaactt acagaagtct 
attacttcaa tatttttaat accgtcactc 
atcctgaggt tttgcttcaa attgatggtg 
cggaagtgat ttcagtatta cagaaatact 
ccccagggat aagcctgtcc agcagcagag 
acgaggaaat acccgtatct tcccactact 
ggaaaaagat gccagcctcc caaaggtcta 
aggcaaaggg ggggtctgcc acatgtagaa 
tcattggatc cagttcagcc tcacatactt 
tggggattat ggctccaccg aagcctataa 
tctcataaca accgaatctc aatgtacata 
catctgtgac tataaagctg ttattcttgt 



tttttgaaag gcctttattc aatacccatt 1560 

ctgaaacacc aagactagga aaaaaaaatg 162 0 

caagcactgc tgtgaaagat cagaataaac 1680 

aaacccaacc ttcctatgat attgataatt 1740 

actgggaaga cataatgcat aatttagcag 1800 

ccatcaagga aggtcggcca attaaatcag 1860 

actgtcttcc agtgtcatct actgctcaaa 192 0 

atactgacaa gtcagcacaa aatttagcat 19 80 

gtcttagttt tcctcataca aaggaaatga 2 040 

ataattttag aactaatcag ctagaggcga 2100 

ttatcctgat gccgactgga. ggtggtaaga 2160 

ctcctggggt cactgttgtc atttctccct 2220 

agctgacttc cttggatatt ccagctacat 2280 

ctacaaatat ttacctccag ttatcaaaaa 2 340 

ctccagaaaa gatctgtgca. agtaacagac 2400 

ggaagctctt ggcacgtttt gttattgatg 2460 

attttcgtca agattacaaa agaatgaata 2 52 0 

tgatggctct tacggccaca gctaatccca 2580 

agattctcag acctcaggtg- tttagcatga 2 640 

tattaccgaa aaagcctaaa aaggtggcat 27 0 0 

acccatatga ttcagggata. atttactgcc 2760 

acacgttaca gagagatggg- ctcgctgctc 2 82 0 

ccagagatga agtgcagcag - aagtggatta 2 880 

caattgcatt tggaatgggg- attgacaaac 2 940 

tccctaaatc tgtggagggt tactaccaag 3 000 

tatctcactg cctgcttttc tatacctatc 3 060 

tgatggaaaa agatggaaac catcatacaa 312 0 

tggtacatta ctgtgaaaat ataacggaat 3180 

gtgaaaatgg atttaatcct gatttttgta 3240 

gctgtaaaac aaaggattat aaaacaagag 33 00 

gatttgttca agaacatagt tcatcacaag 33 60 

ctggaagatt tactatgaat atgctggtcg 3 42 0 

tccagtcagg tatatttgga. aaaggatctg 3480 

ttaaaaagct gatacttgac aagattttgg 3540 

aggcgatcgc ttatgtgatg- ctcggaaata 3 600 

aggtagactt tatggaaaca gaaaattcca 3 660 

caaaagtgtc tcagagggaa. gagatggtta 3 72 0 

gcaaatctct ggggaaagtt tttggtgtcc 3 7 80 

tcaagaagct tgcagaatct ttatcttctg 3 840 

ttactgaaga caaactggaa aaatatggtg 3 9 00 

ctgaatggac atcgccagct gaagacagtt 39 6 0 

gccccggaag aagtgccgct gaggagcttg 402 0 

ttgcaagtaa aaccagaaat gaaaggaaga 4080 

agaggagaaa aactgcttcc agtggttcca 4140 

agatatcttc caaaacgaaa tcctccagca 42 00 

ctcaagcgac atcaggagcc aatagcaaat 42 60 

atagaccgtt tcttaagcct tcatatgcat 432 0 

gaccctcttt cttgtttgto agcatctgac 43 80 
tataccaaaa aaaaaaaaaa aaaaaaa 4437 



<210> 36 
<211> 60 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_000057 



<400> 36 

taagccttca tatgcattct cataacaacc gaatctcaat gtacatagac cctctttctt 60 
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<210> 37 
<211> 2016 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_00006O 
<400> 37 

gccagctgga gcgttttcgg ggctgtaaag 
gaaggcgcgc taagagcaga tttgtggtct 
ctcttttcct ctgcggctgt tacgtggttg 
tggctgacca tcacgaggct gaatattatg 
tgagtctgaa ccctctggct ctcatcagcc 



ggagaatggc gcatgcgcat attcagggcg 60 

gcattatgtc tggagccaga ag-taagcttg 12 0 

ccctgggagc ccacaccggg gaiggagagcg 180 

tggctgccgt gtatgagcat ccatccatcc 2 40 

gccaagaggc cttggagctc atgaaccaga 3 00 



accttgacat ctatgaacag caagtgatga ctgcagccca aaaggatgta cagattatag 3 60 

tgtttccaga agatggcatt catggattca actttacaag aacatccatt tatccatttt 42 0 

tggacttcat gccgtctccc caggtggtca ggtggaaccc atgcctggag cctcaccgct 480 

tcaatgacac agaggtgctc cagcgcctga gttgtatggc catcagggga gsitatgttct 540 

tggtggccaa tcttgggaca aaggagcctt gtcatagcag tgacccaagg tgcccaaaag 600 

atgggagata ccagttcaac acaaatgtcg tgttcagcaa taatggaacc cttgttgacc 660 

gctaccgtaa acacaacctc tactttgagg cagcattcga tgttcctctt aaagtggatc 72 0 

tcatcacctt tgataccccc tttgctggca ggtttggcat cttcacatgc tttgatatat 7 80 

tgttctttga ccctgccatc agagtcctca gagactacaa ggtgaagcat gttgtgtacc 840 

caactgcctg gatgaaccag ctcccactct tggcagcaat tgagattcag aaagcttttg 9 00 

ctgttgcctt tggcatcaac gttctggcag ctaatgtcca ccacccagtt ctggggatga 96 0 

caggaagtgg catacacacc cctctggagt ccttttggta ccatgacatg gaaaatccca 102 0 

aaagtcacct tataattgcc caggtggcca aaaatccagt gggtctcatt ggtgcagaga 1080 

atgcaacagg tgaaacggac ccatcccata gtaagttttt aaaaattttg tcaggcgatc 114 0 

cgtactgtga gaaggatgct caggaagtcc actgtgatga ggccaccaag tg-gaacgtga 12 00 

atgctcctcc cacatttcac tctgagatga tgtatgacaa tttcaccctg gtccctgtct 12 6 0 

ggggaaagga aggctatctc cacgtctgtt ccaatggcct ctgctgttat ttactttacg 132 0 

agaggcccac cttatccaaa gagctgtatg ccctgggggt ctttgatggg cttcacacag 13 80 

tacatggcac ttactacatc caagtgtgtg ccctggtcag gtgtgggggt cttggcttcg 1440 

acacctgcgg acaggaaatc acagaggcca cggggatatt tgagtttcac ctgtggggca 15 00 

acttcagtac ttcctatatc tttcctttgt ttctgacctc agggatgacc ctagaagtcc 15 6 0 

ctgaccagct tggofcgggag aatgaccact atttcctgag gaaaagtagg ctzgtcctctg 162 0 

ggctggtgac ggcggctctc tatgggcgct tgtatgagag ggactaggaa aeigtgtgtgg 1680 

tctgtggggc ggactctggc catcatgttg acagccttgc acttccacag gctacaagcc 1740 

ctgggaccat ctttctgcct taagggcagg agcccacttc tgtggcacca gattccaccc 180 0 

tgggaactgt ggaaaaagta ggagaggcag attccctcag tgtcttcctc ttaaacctca 186 0 

atcatcgaga cattaggggg tattttctgt tcacatttat ctttttcaag ccacatcttc 192 0 

ctctaacaaa tctctcagta tgcgattggt ctcaagctaa aacaaaaata aatgtcagtt 1980 
tatattttac acatccaaaa aaaaaaaaaa aaaaaa 2 016 



<210> 38 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_00006O 



<400> 38 

tcctctaaca aatctctcag tatgcgattg gtctcaagct aaaacaaaaa taaatgtcag 60 



<210> 39 
<211> 811 
<212> DNA 
<213> Homo 



sapiens 
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<300> 

<308> NM_000269 



<400> 39 

gcagaagcgt tccgtgcgtg caagtgctgc gaaccacgtg ggtcccgggc gcgtttcggg 6 0 

tgctggcggc tgcagccgga gttcaaacct aagcagctgg aaggaaccat ggccaactgt 12 0 

gagcgtacct tcattgcgat caaaccagat ggggtccagc ggggtcttgt gg-gagagatt 180 

atcaagcgtt ttgagcagaa aggattccgc cttgttggtc tgaaattcat gcaagcttcc 240 

gaagatcttc tcaaggaaca ctacgttgac ctgaaggacc gtccattctt tgccggcctg 300 

gtgaaataca tgcactcagg gccggtagtt gccatggtct gggaggggct ga_atgtggtg 3 60 

aagacgggcc gagtcatgct cggggagacc aaccctgcag actccaagcc tgggaccatc 42 0 

cgtggagact tctgcataca agttggcagg aacattatac atggcagtga ttctgtggag 480 

agtgcagaga aggagatcgg cttgtggttt caccctgagg aactggtaga ttacacgagc 540 

tgtgctcaga actggatcta tgaatgacag gagggcagac cacattgctt ttcacatcca 6 00 

tttcccctcc ttcccatggg cagaggacca ggctgtagga aatctagtta tttacaggaa 660 

cttcatcata atttggaggg aagctcttgg agctgtgagt tctccctgta cagtgttacc 72 0 

atccccgacc atctgattaa aatgcttcct cccagcatag gattcattga gttggttact 780 
tcatattgtt gcattgcttt tttttccttc t 811 

<210> 40 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_000269 
<400> 40 

gtctgaaatt catgcaagct tccgaagatc ttctcaagga acactacgtt gacctgaagg 60 

<210> 41 
<211> 2338 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM„0 002 91 
<400> 41 

agcgcacgtc ggcagtcggc tccctcgttg accgaatcac cgacctctct ccccagctgt 60 

atttccaaaa tgtcgctttc taacaagctg acgctggaca agctggacgt taaagggaag 12 0 

cgggtcgtta tgagagtcga cttcaatgtt cctatgaaga acaaccagat aacaaacaac 180 

cagaggatta ag^ctgctgt cccaagcatc aaattctgct tggacaatgg agccaagtcg 240 

gtagtcctta tgagccacct aggccggcct gatggtgtgc ccatgcctga caagtactcc 3 00 

ttagagccag ttgctgtaga actcaaatct ctgctgggca aggatgttct gttcttgaag 3 60 

gactgtgtag gcccagaagt ggagaaagcc tgtgccaacc cagctgctgg gtctgtcatc 42 0 

ctgctggaga acctccgctt tcatgtggag gaagaaggga agggaaaaga t^cttctggg 480 

aacaaggtta aagccgagcc agccaaaata gaagctttcc gagcttcact ttccaagcta 540 

ggggatgtct atgtcaatga tgcttttggc actgctcaca gagcccacag ctccatggta 600 

ggagtcaatc tgccacagaa ggctggtggg tttttgatga agaaggagct gaactacttt 660 

gcaaaggcct tggagagccc agagcgaccc ttcctggcca tcctgggcgg agctaaagtt 72 0 

gcagacaaga tccagctcat caataatatg ctggacaaag tcaatgagat geittattggt 7 80 

ggtggaatgg cttttacctt ccttaaggtg ctcaacaaca tggagattgg cacttctctg 840 

tttgatgaag agggagccaa gattgtcaaa gacctaatgt ccaaagctga geiagaatggt 900 

gtgaagatta ccttgcctgt tgactttgtc actgctgaca agtttgatga gaatgccaag 9 60 

actggccaag ccactgtggc ttctggcata cctgctggct ggatgggctt g^actgtggt 102 0 

cctgaaagca gcaagaagta tgctgaggct gtcactcggg ctaagcagat tgtgtggaat 1080 

ggtcctgtgg gg-^tatttga atgggaagct tttgcccggg gaaccaaagc tctcatggat 1140 

gaggtggtga aagccacttc taggggctgc atcaccatca taggtggtgg agacactgcc 12 00 

acttgctgtg ccaaatggaa cacggaggat aaagtcagcc atgtgagcac tgggggtggt 12 60 

gccagtttgg agctcctgga aggtaaagtc cttcctgggg tggatgctct cagcaatatt 132 0 



21 



WO 2005/086891 



PCT/US2005/007894 



tagtactttc ctgcctttta gttcctgtgc acagccccta agtcaactta g-cattttctg 13 80 

catctccact tggcattagc taaaaccttc catgtcaaga ttcagctagt ggccaagaga 1440 

tgcagtgcca ggaaccctta aacagttgca cagcatctca gctcatcttc actgcaccct 1500 

ggatttgcat acattcttca agatcccatt tgaatttttt agtgactaaa ccattgtgca 1560 

ttctagagtg catatattta tattttgcct gttaaaaaga aagtgagcag tgttagctta 162 0 

gttctctttt gatg-taggtt attatgatta gctttgtcac tgtttcacta ctcagcatgg 1680 

aaacaagatg aaattccatt tgtaggtagt gagacaaaat tgatgatcca ttaagtaaac 1740 

aataaaagtg tccattgaaa ccgtgatttt tttttttttc ctgtcatact ttgttaggaa 1800 

gggtgagaat agaatcttga ggaacggatc agatgtctat attgctgaat g-caagaagtg 1860 

gggcagcagc agtggagaga tgggacaatt agataaatgt ccattcttta tcaagggcct 192 0 

actttatggc agacattgtg ctagtgcttt tattctaact tttattttta tcagttacac 19 80 

atgatcataa tttaaaaagt caaggcttat aacaaaaaag ccccagccca ttcctcccat 2 040 

tcaagattcc cactccccag aggtgaccac tttcaactct tgagtttttc aggtatatac 2100 

ctccatgttt ctaagtaata tgcttatatt gttcacttcc ttttttttta ttttttaaag 2160 

aaatctattt cataccatgg aggaaggctc tgttccacat atatttccac ttcttcattc 222 0 

tctcggtata gttttgtcac aattatagat tagatcaaaa gtctacataa ctaatacagc 22 80 
tgagctatgt agteitgctat gattaaattt acttatgtaa aaaaaaaaaa aaaaaaaa 233 8 



<210> 42 
<211> 60 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM„00029 1 



<400> 42 

acttagcatt ttctgcatct ccacttggca ttagctaaaa ccttccatgt caagattcag 60 



<210> 43 
<211> 787 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM__0003 6 3 
<400> 43 

ctgaaggtca cccgggcggc cccctcactg 
ctcctgccat tcccggcctg agtctcagca 
aacctcgccc tgcaccagcc ccaatcagac 
cggagccgca cgccaagaaa aaatctaaga 
ctctgctgct gcagattgca aagcaagagc 
agaaggggcg cgctctgagc acccgctgcc 
cggagctgca ggctcttgtgc cgacagctcc 
gatacgacat agaggcaaaa gtcaccaaga 
agatctttga ccttcgaggc aagtttaagc 
cagatgccat gatgcaggcg ctgctggggg 
cccacctcaa gcaggtgaag aaggaggaca 
ggcggaagaa catcgatgca ctgagtggaa 
gagccttcct gcctactgcc cctgccctga 
tgagctg 787 



accctccaaa cgcccctgtc otcgccctgc 60 

tggcggatgg gagcagcgat ^cggctaggg 12 0 

gccgctcctc caactaccgc gcttatgcca 180 

tctccgcctc gagaaaattg cagctgaaga 240 

tggagcgaga ggcggaggag cggcgcggag 3 00 

agccgctgga gttgaccggg ctgggcttcg 3 60 

acgcccgtgt ggacaaggtg gatgaagaga 42 0 

acatcacgga gattgcagat ctgactcaga 480 

ggcccaccct gcggagagtg a.ggatctctg 540 

cccgggctaa ggagtccctg g-acctgcggg 600 

ccgagaagga aaaccgggag gtgggagact 660 

tggagggccg caagaaaaag tttgagagct 72 0 

ggagggccac tgaggaataa agcttctctc 780 



<210> 44 
<211> 60 
<212> DNA 
<213> Homo 



sapiens 



<300> 

<308> NM_000363 
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<400> 44 

tgtggacaag gtggatgaag agagatacga catagaggca aaagt caeca agaacatcac 6 0 

<210> 45 

<211> 1263 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> INTM_0003 65 
<400> 45 

ggcacgagac cttcagcgcc tcggctccag cgccatggcg ccctccagga aigttcttcgt 6 0 

tgggggsiaac tggaagatga aegggeggaa gcagagtctg ggggagctca tcggcactct 12 0 

gaacgcggcc aaggtgccgg ccgacaccga ggtggtttgt gctcccccta ctgectatat 180 

cgacttcgcc eggcagaage tagatcccaa gattgctgtg getgegcaga actgetacaa 240 

agtgactaat ggggctttta ctggggagat cagccctggc atgatcaaag actgeggage 3 00 

cacgtg^gtg gtcctggggc actcagagag aaggcatgtc tttggggagt cagatgagct 3 60 

gattgg^cag aaagtggccc atgctctggc agagggactc ggagtaatccj ectgeattgg 42 0 

ggagaagcta gatgaaaggg aagctggcat cactgagaag gttgttttcg agcagacaaa 480 

ggtcatcgea gataacgtga aggactggag caaggtegtc ctggcctato; cagcctgtgtg 540 

ggccattggt actggcaaga ctgcaacacc ccaacaggcc caggaagtac acgagaagct 600 

ccgaggsLtgg ctgaagtcca aegtctctga tgcggtggct cagagcaccc gtatcattta 660 

tggaggctct gtgactgggg caacctgcaa ggagctggcc agccagcctg sitgtggatgg 72 0 

cttccttgtg ggtggtgctt ccctcaagcc egaattegtg gacatcatca atgecaaaca 780 

atgagcccca tccatcttcc ctacccttcc tgccaagcca gggactaagc ageccagaag 840 

cccagtaact gccctttccc tgeatatget tctgatggtg tcatctgctc ottcctgtgg 900 

cctcatccaa actgtatctt cctttactgt ttatatcttc accctgtaat ^gttgggacc 960 

aggccaatcc cttctccact tactataatg gttggaacta aacgtcacca s.ggtggcttc 102 0 

tccttg-g-ctg agagatggaa ggcgtggtgg gatttgetec tgggttccct aggecctagt 1080 

gagggcagaa gagaaaccat cctctccctt cttacaccgt gaggecaaga tcccctcaga 1140 

aggcag-Q-agt gctgccctct cccatggtgc ccgtgcctct gtgctgtgta tgtgaaccac 1200 

ccatgt^agg gaataaacct ggcactagga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 12 60 
aaa 3_ 2 6 3 

<210> 46 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> 3NrM_000365 
<400> 46 

tatcttcacc ctgtaatggt tgggaccagg ccaatccctt ctccacttac 

<210> 47 

<211> 1616 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_000582 
<400> 47 

ctccctgtgt tggtggagga tgtctgeage agcatttaaa ttctgggagg 
cagcagcagc aggaggaggc agagcacagc ategteggga ccagactcgt 
ttgcacjcctt ctcagccaaa cgccgaccaa ggaaaactca ctaccatgag 
atttgotttt gcctcctagg catcacctgt gccataccag ttaaacaggc 
agttctgagg aaaagcagct ttacaacaaa tacccagatg ctgtggccac 



tataatggtt 60 



gcttggttgt 60 

ctcaggccag 12 0 

aattgcagtg 180 

tgattctgga 240 

atggctaaac 3 00 
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cctgacccat 
gaaagccatg 
caggactcca 
tctgatgagt 
ctgccagcaa 
ggtgatagtg 
cagtaccctg 
ggtgcataca 
cgtgggaagg 
cacaagcagt 
gtgattgata 
agccatgaag 
tttcgtattt 
tacaatttct 
aacatgaaat 
taactaatgt 
aaagcttcag 
ttaatatttg 
cccacttaaa 
tgtatatttt 
taagaatttg 
aacataacct 



ctcagaagca 
accacatgga 
ttgactcgaa 
ctcaccattc 
ccgaagtttt 
tggtttatgg 
atgctacaga 
aggccatccc 
acagttatga 
ccagattata 
gtcaggaact 
atatgctggt 
ctcatgaatt 
cactttgcat 
gcttctttct 
gtttgataat 
ggttatgtct 
ttattctctc 
aagagaatat 
gttgtgatta 
gtggtgtcaa 
tttttactgc 



gaatctccta 
tgatatggat 
cgactctgat 
tgatgaatct 
cactccagtt 
actgaggtca 
cgaggacatc 
cgttgcccag 
aacgagtcag 
taagcggaaa 
ttccaaagtc 
tgtagacccc 
agatagtgca 
ttagtcaaaa 
cagtttattg 
tagtttagtt 
atgttcattc 
atgaatagaa 
aacattttat 
tctttttgtg 
ttgcttattt 
ctaaaaaaaa 



gccccacaga 
gatgaagatg 
gatgtagatg 
gatgaactgg 
gtccccacag 
aaatctaaga 
acctcacaca 
gacctgaacg 
ctggatgacc 
gccaatgatg 
agccgtgaat 
aaaagtaagg 
tcttctgagg 
gaaaaaatgc 
gttgaatgtg 
tgtggcttca 
tatagaagaa 
atttatgtag 
gtcactataa 
gtgtgaataa 
gttttcccac 
aaaaaaaaaa 



cccttccaag 
atgatgacca 
acactgatga 
tcactgattt 
tagacacata 
agtttcgcag 
tggaaagcga 
cgccttctga 
agagtgctga 
agagcaatga 
tccacagcca 
aagaagataa 
tcaattaaaa 
tttatagcaa 
tatctatttg 
tggaaactcc 
atgcaaacta 
aagcaaacaa 
tcttttgttt 
atcttttatc 
ggttgtccag 
aaaaaaaaaa 



taagtccaacz: 3 60 

tgtggacagcz: 42 0 

ttctcaccag- 480 

tcccacgg-acz: 540 

tgatggccgsL 6 00 

acctgacatc 660 

ggagttgaafc 72 0 

ttgggacagc 7 80 

aacccacagc 840 

gcattccg-at: 9 00 

tgaatttcacz: 96 0 

acacctgaaa 102 0 

ggagaaaaaa 1080 

aatgaaag-acg 1140 

agtctggaaa. 12 00 

ctgtaaacta 12 6 0 

tcactgtatt 1320 

aatactttta. 13 80 

tttaagttag 1440 

ttgaatgtasi 1500 

caattaataa. 15 6 0 
aaaaaa 1^16 



<210> 48 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_0 O0582 



<400> 48 

ggtggtgtca. attgcttatt tgttttccca cggttgtcca gcaattaata aaacataacc 60 



<210> 49 
<211> 1666 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_0 00584 



<400> 49 

ctccataagg cacaaacttt cagagacagc 
ggaagaaacc accggaagga accatctcac 
tggctctctt ggcagccttc ctgatttctg 
ggagtgctaa agaacttaga tgtcagtgca 
aatttatcaa agaactgaga gtgattgaga 
ttgtaaagct ttctgatgga agagagctct 
gggttgtgga gaagtttttg aagagggctg 
tatccaagaa tcagtgaaga tgccagtgaa 
gtattgtgtg- ggtctgttgt agggttgcca 
aatttcagta. aacaatgaat agtttttcat 
atgtaaagta. ttatttattt gaatctacaa 
ttttcctaga. tattgcacgg gagaatatac 
gaatatccga. actttaattt caggaattga 
atcacataaa aatgatggga caataaattt 
tggatttttt tctgttaaat ctggcaaccc 
gttccactgt gccttggttt ctcctttatt 
tacctcacag tgatgttgtg aggacatgtg 
aaattatttt caagtgtaac ttattaacct 



agagcacaca agcttctagg acaagagcca 60 

tgtgtgtaaa catgacttcc aagctggcc^ 12 0 

cagctctgtg tgaaggtgca gttttgcca_a 180 

taaagacata ctccaaacct ttccacccca 240 

gtggaccaca ctgcgccaac acagaaatta 3 00 

gtctggaccc caaggaaaac tgggtgcag-«a. 3 60 

agaattcata aaaaaattca ttctctgtg-g 42 0 

acttcaagca aatctacttc aacacttca_t: 480 

gatgcaatac aagattcctg gttaaattt-cj 540 

tgtaccatga aatatccaga acatactta_t 600 

aaaacaacaa ataattttta aatataagg-a. 660 

aaatagcaaa attgaggcca agggccaagra. 72 0 

atgggtttgc tagaatgtga tatttgaag-c 780 

tgccataaag tcaaatttag ctggaaatcc 840 

tagtctgcta gccaggatcc acaagtcctt 9 00 

tctaagtgga aaaagtatta gccaccatct 960 

gaagcacttt aagttttttc atcataaca_t 102 0 

atttattatt tatgtattta tttaagcatc 1080 
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aaatatttg-t gcaagaattt ggaaaaatag aagatgaatc attgattgaa tagttataaa 1140 

gatgttatao; taaatttatt ttattttaga tattaaatga tgttttatta gataaatttc 12 00 

aatcagggtt tttagattaa acaaacaaac aattgggtac ccagttaaat tttcatttca 12 60 

gataaacaac aaataatttt ttagtataag tacattattg tttatctgaa attttaattg 132 0 

aactaacaat cctagtttga tactcccagt cttgtcattg ccagcto;tgt tggtagtgct 13 80 

gtgttgaatt acggaataat gagttagaac tattaaaaca gccaaaactc cacagtcaat 1440 

attagtaatt tcttgctggt tgaaacttgt ttattatgta caaatagatt cttataatat 15 00 

tatttaaatg actgcatttt taaatacaag gctttatatt tttaacttta agatgttttt 1560 

atgtgctctc caaatttttt ttactgtttc tgattgtatg gaaatataaa agtaaatatg 162 0 

aaacatttaa aatataattt gttgtcaaag taaaaaaaaa aaaaaa 16 66 



<210> 50 
<211> 60 
<212> DNA. 

<213> Homo sapiens 
<300> 

<308> NM_000584 



<400> 50 

tggtagtgct gtgttgaatt acggaataat gagttagaac tattaaaaca gccaaaactc 60 



<210> 51 

<211> 172 2 

<212> DNA. 

<213> Homo sapiens 



<300> 

<308> NM_000599 



<400> 51 

ggggaaaacja 

ttttcattac 

tcgaagagcja 

ttgtttttg-t 

aacgcctctt 

ggtgtattt t 

caacgccatc 

tttgtggctt 

ccgctcactg 

tcttgcaact 

aagtgtctg-g 

aaaaaaatct 

accccgagcjt 

gctggccgcc 

cgacgagaaa 

gccgggctcjc 

caccgagccj c 

gcacgccctg 

caagatcgag 

gacctactcc 

agcagtgaaa-g 

gaacactgcc 

cccctgcccjc 

gccccgtgct 

caaaccttcc 

gctgccagcjc 

cgttgagtcja 

tccagccagc 

atatatatct 



gctaggaaag 
ccctcctccg 
ggtggcaaag 
tttttaattt 
acctgttctg 
agattttaag 
tccactgcat 
tttttcccct 
cgtgcacctg 
gggacctgct 
gggagggcac 
ccgggggccc 
aaagggggcg 
tatgcggggc 
gccctctcca 
ggctgctgca 
tgcgcccagg 
ctgcacggcc 
agagactccc 
cccaagatct 
aaggaccgca 
cacccccgga 
agacacatgg 
gtgtacctgc 
cgtggccgca 
atggagtacg 
tgcgtccccc 
gcctccctcc 
atctatttga 



agctgcaaag 
ttttcaccct 
tgggagaaaa 
cttgatttca 
cggcgccgcg 
caaaaatttt 
ccgatctcat 
ataattctga 
gcgctgccct 
cgcaggcacc 
ctgctctacc 
tcttggcccc 
actaagagaa 
cggcccagag 
tgtgcccccc 
tgacctgcgc 
ggctgcgctg 
gcggggtttg 
gtgagcacga 
tccggcccaa 
gaaagaagct 
tcatctctgc 
aggcttccct 
ccaattgtga 
agcgtggcat 
ttgacgggga 
cccaaccttt 
accccaggac 
ggaaaaaaaa 



cagtgtgggc 
tctccggact 
gaggtgttag 
acattttctc 
caccgctggc 
aaagataaat 
tatttcggtg 
cccgctcagg 
gcttccccca 
ccagccctcc 
tgccagaaat 
tttatccctg 
gatggtgttg 
cctgggctcc 
cagccccctg 
cctggccgag 
cctcccccgg 
cctcaacgaa 
ggagcccacc 
acacacccgc 
gacccagtcc 
acctgagatg 
gcaggagctc 
ccgcaaagga 
ctgctggtgc 
ctttcagtgc 
ccctcacccc 
gccactcatt 
aaaaaaaaaa 



tttttccctt 
tcgcgtagaa 
ggtttcrgggt 
ccaccctctc 
agctga_gggt 
ccatttttct 
gttgcfc tggg 
cttgagrggtt 
acctgttgca 
acctctctct 
tttaaaacaa 
cactctcgct 
ctcaccgcgg 
ttcgtcjcact 
ggctgcgagc 
gggcagj-tcgt 
caggacgagg 
aagagctacc 
acctctgaga 
atctccgagc 
aagtttgtcg 
agacagj-gagt 
aaagccagcc 
ttctacaaga 
g tggac aagt 
cacaccttcg 
ctcccacccc 
tcatctcatt 
aa 1722 



tttttgctcc 
cctgcgaatt 
ttttttgttt 
ggctgcagcc 
tagaaagcgg 
ctcccacccc 
ggtgaacaat 
tctccggcct 
aggctttaat 
acatttttgc 
aaacaaaaac 
ctcctgcccc 
tcctcctgct 
gcgagccctg 
tggtcaagga 
gcggcgtcta 
agaagccgct 
gcgagcaagt 
tggccgagga 
tgaaggctga 

ggggagccga 
ctgagcaggg 
cacgcatggt 
gaaagcagtg 
acgggatgaa 
acagcagcaa 
cagccccgac 
taagggaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 
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<210> 52 

<211> SO 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> 1NTM_000599 
<400> 52 

ccaggacgcc actcatttca tctcatttaa gggaaaaata tatatctatc tatttgagga 60 

<210> 53 

<211> 704 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> !NM_000735 
<400> 53 

gcagttactg agaactcata agacgaagct aaaatccctc ttcggatcca cagtcaaccg 60 

ccctgaacac atcctgcaaa aagcccagag aaaggagcgc catggattac tacagaaaat 12 0 

atgcagctat ctttctggtc acattgtcgg tgtttctgca tgttctccat tccgctcctg 180 

atgtgcagga ttgcccagaa tgcacgctac aggaaaaccc attcttctcc cagccgggtg 240 

ccccaatact tcagtgcatg ggctgctgct tctctagagc atatcccact ccactaaggt 3 00 

ccaagaagac gatgttggtc caaaagaacg tcacctcaga gtccacttgc tgtgtagcta 3 60 

aatcatataa cagggtcaca gtaatggggg gtttcaaagt ggagaaccac acggcgtgcc 42 0 

actgcacrtac ttgttattat cacaaatctt aaatgtttta ccaagtgctg tcttgatgac 480 

tgctgatttt ctggaatgga aaattaagtt gtttagtgtt tatg^ctttg tgagataaaa 540 

ctctcctttt ccttaccata ccactttgac acgcttcaag gatatactgc agctttactg 600 

ccttcctcct tatcctacag tacaatcagc agtctagttc ttttcatttg gaatgaatac 660 

agcattaagc ttgttccact gcaaataaag ccttttaaat catc 7 04 

<210> 54 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> !NM__00073 5 
<400> 54 

tgagataaaa ctctcctttt ccttaccata ccactttgac accjcttcaag gatatactgc 6 0 

<210> 55 

<211> 1342 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> !NM_000799 
<400> 55 

cccggagccg gaccggggcc accgcgcccg ctctgctccg acaccgcgcc ccctggacag 60 

ccgccctctc ctccaggccc gtggggctgg ccctgcaccg ccgagcttcc cgggatgagg 12 0 

gccccccjgtg tggtcacccg gcgcgcccca ggtcgctgag ggaccccggc caggcgcgga 180 

gatgggrcjgtg cacgaatgtc ctgcctggct gtggcttctc ctgtccctgc tgtcgctccc 240 

tctggg-cctc ccagtcctgg gcgccccacc acgcctcatc tgtgacagcc gagtcctgga 3 00 

gaggtacctc ttggaggcca aggaggccga gaatatcacg accjg^ctgtg ctgaacactg 3 60 

cagcttcjaat gagaatatca ctgtcccaga caccaaagtt aatttictatg cctggaagag 42 0 
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gatgg-aggtc 
tgtcctgcgg 
gcatcg-tggat 
agcccagaag 
cactgctgac 
gaagctgtac 
ggcatatcca 
gaaccccgtc 
gcaatgacat 
tcacagggcc 
ggacagagcc 
aggacacgct 
tggacyaactt 
ggtggcaaga 
gcctctggct 
aaaccaccaa 



gggcagcagg 
ggccaggccc 
aaagccgtca 
gaagccatct 
actttccgca 
acaggggagg 
ccacctccct 
gaggggctct 
ctcaggggcc 
aacttgaggg 
atgctgggaa 
ttggaggcga 
aggtggcaag 
gcccccttga 
ctcatggggt 
aaaaaaaaaa 



ccgtagaagt 
tgttggtcaa 
gtggccttcg 
cccctccaga 
aactcttccg 
cctgcaggac 
caccaacatt 
cagctcagcg 
agaggaactg 
cccagagcag 
gacgcctgag 
tttacctgtt 
ctgtgacttc 
caccggggtg 
ccaagttttg 
aa 1342 



ctggcagggc 
ctcttcccag 
cagcctcacc 
tgcggcctca 
agtctactcc 
aggggacaga 
gcttgtgcca 
ccagcctgtc 
tccagagagc 
gaagcattca 
ctcactcggc 
ttcgcaccta 
tccaggtctc 
gtgggaacca 
tgtattcttc 



ctggccctgc 
ccgtgggagc 
actctgcttc 
gctgctccac 
aatttcctcc 
tgaccaggtg 
caccctcccc 
ccatggacac 
aactctgaga 
gagagcagct 
accctgcaaa 
ccatcaggga 
acgggcatgg 
tgaagacagg 
aacctcattg 



tgtcggaagc 
ccctgcagct 
gggctctgcg 
tccgaacaat 

ggggaaagct 

tgtccacctg 
cgccactcct 
tccagtgcca 
tctaaggatg 
ttaaactcag 
atttgatgcc 
caggatgacc 
gcactccctt 
atgggggctg 
acaagaactg 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 



<210> 56 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NMJ3007 99 



<400> 56 

tcatg-gggtc caagttttgt gtattcttca acctcattga caagaactga aaccaccaaa 60 

<210> 57 

<211> 2722 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_000917 



<400> 57 

gagcg-ggctg agggtaggaa gtagccgctc cgagtggagg cgactggggg ctgaagagcg 60 

cgccgccctc tcgtcccact ttccaggtgt gtgatcctgt aaaattaaat cttccaagat 12 0 

gatctggtat atattaatta taggaattct gcttccccag tctttggctc atccaggctt 180 

ttttacttca attggtcaga tgactgattt gatccatact gagaaagatc tggtgacttc 240 

tctgaaagat tatattaagg cagaagagga caagttagaa caaataaaaa aatgggcaga 3 00 

gaagttagat cggctaacta gtacagcgac aaaagatcca gaag-gatttg ttgggcatcc 3 60 

agtaaatgca ttcaaattaa tgaaacgtct gaatactgag tggeigtgagt tggagaatct 42 0 

ggtccttaag gatatgtcag atggctttat ctctaaccta accattcaga gaccagtact 480 

ttctaatgat gaagatcagg ttggggcagc caaagctctg ttacgtctcc aggataccta 540 

caat-fctggat acagatacca tctcaaaggg taatcttcca ggaQtgaaac acaaatcttt 600 

tctaacggct gaggactgct ttgagttggg caaagtggcc tatacagaag cagattatta 660 

ccatacggaa ctgtggatgg aacaagccct aaggcaactg gatgaaggcg agatttctac 72 0 

catagataaa gtctctgttc tagattattt gagctatgcg gtatatcagc agggagacct 780 

ggataaggca cttttgctca caaagaagct tcttgaacta gatcctgaac atcagagagc 840 

taatggtaac ttaaaatatt ttgagtatat aatggctaaa gaaaaagatg tcaataagtc 900 

tgcttcagat gaccaatctg atcagaaaac tacaccaaag aaaaaagggg ttgctgtgga 9 60 

ttacctgcca gagagacaga agtacgaaat gctgtgccgt gg-g^agggta tcaaaatgac 102 0 

ccctcggaga cagaaaaaac tcttttgccg ctaccatgat ggaaaccgta atcctaaatt 1080 

tattctggct ccagctaaac aggaggatga atgggacaag cctcgtatta ttcgcttcca 1140 

tgatcLttatt tctgatgcag aaattgaaat cgtcaaagac ctagcaaaac caaggctgag 12 00 

ccgagctaca gtacatgacc ctgagactgg aaaattgacc acagcacagt acagagtatc 12 60 

taag-a.gtgcc tggctctctg gctatgaaaa tcctgtggtg tctcgaatta atatgagaat 132 0 

acaagatcta acaggactag atgtttccac agcagaggaa ttacaggtag caaattatgg 13 80 

agttggagga cagtatgaac cccattttga ctttgcacgg aaagatgagc cagatgcttt 1440 
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caaagagctg 
tgcaggagga 
tgctgttttc 
agcctgtcca 
acaagaattt 
ttctcctatt 
ggag"tttaca 
catctgtgga 
tgcataaaac 
aattttatat 
gtatttcact 
aaatgcctta 

atctgaagac 
tgatcttcta 
taccaacccc 
aatatttcga 
ttaactaaaa 
ttttgttcag 
cagcagggtg 
aaaaatgtac 
agaaagccaa 
tttatgatct 



gggacaggaa 
gccactgttt 
tggtataatc 
gtgctagttg 
cgaagacctt 
gttgtactct 
attgactaac 
caattgctta 
cttaaagttc 
ttaaagaact 
acatctcagt 
cagatgtgcc 

tagtaataaa 
agttgatttt 
ctctcttttc 
tttttttttt 
tgctcaccac 
gaattttgct 
gtaatattgg 
aagtcctttc 
aatgtttgtt 
taaatctttt 



atagaattgc 
ttcctgaagt 
tgtttgccag 
gcaacaaatg 
gtacgttgtc 
tatgtgtctg 
actccatgat 
ctttgtgggt 
tgttggtatc 
ttttggttgg 
tggtgggtgt 
taggtgttct 

ctaggacact 
tttcctccca 
ttttagctcc 
ttttttaatg 
ttagcctgtt 
gtttttctta 
cattctgatt 
agtgatgagg 
tttttaagat 
tt 2722 



tacatggctg 
tggagctagt 
tggagaagga 
ggtatccaat 
agaattggaa 
atatacacat 
tgattcagtc 
tcttttaaaa 
acagaagaca 
ataaaaacat 
taagctagaa 
gtttacctag 

aactgggttc 
agtctttttt 
tctgtggtga 
gaaagtcctg 
tttttatccc 
gtgctaatac 
aaatactgtg 
gaattgattt 
tctgaaatgt 



ttttatatga 
gtttggccca 
gattatagta 
aaatggctcc 
tgacaaacag 
ttccatagtc 
atgaacctca 
gtaacacgaa 
acg-gcagagtt 
aatttgagca 
tg-ggctgtgt 
tgtcttactc 

catgtgattg 
aaagaaagta 
attaaacgta 
cataacaaca 
ttttttaaaa 
cttgcctctt 
ccttaggaga 
tttttaaaag 
gttgtgacaa 



gtgatgtgtc 
aaaaaggaac 
cacggcatgc 
atgaacgtgg 
gcttcccttt 
ttaactttca 
tcccatgttt 
atcatcatat 
taaagtgagg 
tccagtttta 
gataggaaac 
tgttttctgg 

ccctttcata 
tactgtattt 
cttgagttaa 
ctgggccttc 
tgacagatga 
attcctgcta 
ctggaagttt 
tctttttctt 
caatgaccta 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 

2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



<210> 58 

<2in_> 60 

<212> DNA 

<213> Homo sapiens 

<30O> 

<308> NM_000917 



<40O> 58 

tcttactctg ttttctggat ctgaagacta gtaataaact aggacactaa ctgggttcca 60 



<210> 59 

<211> 3236 

<212> DNA 

<213> Homo sapiens 

<30O> 

<308> NM_001109 



<40O> 59 

gacccggcca 

gcccccagcc 

ccaggccccc 

agctacgtcc 

ctgctgggtt 

cagcctcgcg 

gccgccagcc 

cacctgatcg 

gagcacctgc 

ctgggacccc 

gagacccgct 

agcgaagcag 

cagaaactca 

ttccacgtca 

caacggacac 

gggsictactg 

aaccaggacc 



tgcgcggcct 
ggccctgggc 
gagtccgccg 
ttggggccac 
ccggctacac 
ggcaggacca 
tcagcacctg 
agcccctgga 
tgcagacggc 
ggacggcagc 
acgtggagct 
ccgtgcgtca 
acttccgtgt 
gccccgaccc 
ggcggcacct 
tggggtttgc 
acagcaagaa 



cgggctctgg 
cctcatggag 
agctctgccc 
agggcacaac 
agagacctat 
ctgcttatac 
tgccggcctc 
tgaaggtggc 
cgggacctgc 
cgtcttcagg 
gtatgtggtc 
tcgggtgctg 
ggtcctggtg 
cagtgtcaca 
gcatgacaac 
cagggtgtcc 
ccccgtgggc 



ctgctgggcg 
cagtatgagg 
tcccacttgg 
ttcaccctcc 
acggctgcca 
cagggccacg 
aggggtttct 
gagggcggac 

ggggtcagcg 
cctcggcccg 
gtggacaatg 
gaggtggtga 
ggcctggaga 
ctggagaacc 
gtacagctca 
gccatgtgct 
gtggcctgca 



cgatgatgct 
tcgtgttgcc 
gcctgcaccc 
acctgcggaa 
a-fcggctccga 
tagaggggta 
tccaggtggg 
ggcacgccgt 
acgacagcct 
g^gactctct 
cagagttcca 
atcacgtgga 
t -fctggaatag 
tcctgacctg 
tcacgggtgt 
cccacagctc 
ccatggccca 



gcctgcgatt 
gcggcgtctg 
agagagggtg 
gaacagggac 
ggtgacggag 
cccggactca 
gtcagacctg 
gtaccaggct 
gggcagcctc 
gccatcccga 
gatgctgggg 
caagctatat 
tcaggacagg 
gcaggcacgg 
cgacttcacc 
aggggctgtg 
tgagatgggc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 
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cacaacctgg gcatggacca tgatgagaac gtccagggct gccgctgcca ggaacgcttc 1080 

gaggccggcc gctgcatcat ggcaggcagc attggctcca gtttccccag gatgttcagt 1140 

gactgcagcc aggcctacct ggagagcttt ttggagcggc cgcagtcggt gtgcctcgcc 12 00 

aacgcccctg acctcagcca cctggtgggc ggccccgtgt gtgggaacct gtttgtggag 12 60 

cgtg^ggagc agtgcgactg cggccccccc gaggactgcc ggaaccgctg ctgcaactct 132 0 

accacctgcc agctggctga gggggcccag tgtgcgcacg gtacctgctg ccaggagtgc 13 80 

aaggtgaagc cggctggtga gctgtgccgt cccaagaagg acatgtgtga cctcgaggag 1440 

ttct^tgacg gccggcaccc tgagtgcccg gaagacgcct tccaggagaa cggcacgccc 1500 

tgctccgggg gctactgcta caacggggcc tgtcccacac tggcccagca gtgccaggcc 15 60 

ttctgggggc caggtgggca ggctgccgag gagtcctgct tctcctatga catcctacca 162 0 

ggctgcaagg ccagccggta cagggctgac atgtgtggcg ttctgcagtg caagggtggg 1680 

cagcagcccc tggggcgtgc catctgcatc gtggatgtgt gccacgcgct caccacagag 1740 

gatggcactg cgtatgaacc agtgcccgag ggcacccggt gtggaccaga gaaggtttgc 1800 

tggaaaggac gttgccagga cttacacgtt tacagatcca gcaactgctc tgcccagtgc 1860 

cacaaccatg gggtgtgcaa ccacaagcag gagtgccact gccacgcggg ctgggccccg 192 0 

ccccactgcg cgaagctgct gactgaggtg cacgcagcgt ccgggagcct ccccgtcctc 19 80 

gtgg-tggtgg ttctggtgct cctggcagtt gtgctggtca ccctggcagg catcatcgtc 2040 

taccgcaaag cccggagccg catcctgagc aggaacgtgg ctcccaagac cacaatgggg 2100 

cgctccaacc ccctgttcca ccaggctgcc agccgcgtgc cggccaaggg cggggctcca 216 0 

gccccatcca ggggccccca agagctggtc cccaccaccc acccgggcca gcccgcccga 222 0 

cacccggcct cctcggtggc tctgaagagg ccgccccctg ctcctccggt cactgtgtcc 2280 

agcccaccct tcccagttcc tgtctacacc cggcaggcac caaagcaggt catcaagcca 23 40 

acgttcgcac ccccagtgcc cccagtcaaa cccggggctg gtgcggccaa ccctggtcca 2400 

gctg-oigggtg ctgttggccc aaaggttgcc ctgaagcccc ccatccagag gaagcaagga 2 46 0 

gccggagctc ccacagcacc ctaggggggc acctgcgcct g-tgtggaaat ttggagaagt 252 0 

tgcg-cycagag aagccatgcg ttccagcctt ccacggtcca gctagtgccg ctcagcccta 2580 

gaccctgact ttgcaggctc agctgctgtt ctaacctcag taatgcatct acctgagagg 2 640 

ctcctgctgt ccacgccctc agccaattcc ttctccccgc cttggccacg tgtagcccca 2700 

gctgtctgca ggcaccaggc tgggatgagc tgtgtgcttg c^ggtgcgtg tgtgtgtacg 27 60 

tgtctccagg tggccgctgg tctcccgctg tgttcaggag gccacatata cagcccctcc 2 82 0 

cagccacacc tgcccctgct ctggggcctg ctgagccggc tgccctgggc acccggttcc 2880 

aggcagcaca gacgtggggc atccccagaa agactccatc ccaggaccag gttcccctcc 2940 

gtgctcttcg agagggtgtc agtgagcaga ctgcacccca agctcccgac tccaggtccc 3 000 

ctga_tcttgg gcctgtttcc catgggattc aagagggaca gccccagctt tgtgtgtgtt 3060 

taagcttagg aatgcccttt atggaaaggg ctatgtggga gagtcagcta tcttgtctgg 312 0 

ttttcttgag acctcagatg tgtgttcagc agggctgaaa gcttttattc tttaataatg 3180 

agaaatgtat attttactaa taaattattg accgagttct g-tagattctt gttaga 323 6 

<210> 60 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<30O> 

<308> NMJ301109 
<40O> 60 

ctttatggaa agggctatgt gggagagtca gctatcttgt ctggttttct tgagacctca 60 

<210> 61 

<210_> 1449 

<212> DNA 

<213> Homo sapiens 

<30O> 

<308> NMJD01124 
<40O> 61 

ctggatagaa cagctcaagc cttgccactt cgggcttctc actgcagctg ggcttggact 60 

tcgg-eigtttt gccattgcca gtgggacgtc tgagactttc tccttcaagt acttggcaga 12 0 

tcactctctt agcagggtct gcgcttcgca gccgggatga a_^ctggtttc cgtcgccctg 180 
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atgtacctgg gttcgctcgc cttcctaggc gctgacaccg ctcggttgga tgtcgcgtcg 240 

gagtttcgaa agaagtggaa taagtgggct ctgagtcgtg ggaagaggga actgcggatg 3 00 

tccagcagct accccaccgg gctcgctgac gtgaaggccg ggcctgccca gacccttatt 3 60 

cggccccagg acatgaaggg tgcctctcga agccccgaag acagcagtcc ggatgccgcc 42 0 

cgcatccgag tcaagcgcta ccgccagagc atgaacaact tccagggcct ccggagcttt 480 

ggctgccgct tcgggacgtg cacggtgcag aagctggcac accagatcta ccagttcaca 540 

gataaggaca aggacaacgt cgcccccagg agcaagatca gcccccaggg ctacggccgc 600 

cggcgccggc gctccctgcc cgaggccggc ccgggtcgga ctctggtgtc ttctaagcca 660 

caagcacacg gggctccagc ccccccgagt ggaagtgctc cccactttct ttaggattta 72 0 

ggcgcccatg gtacaaggaa tagtcgcgca agcatcccgc tggtgcctcc cgggacgaag 7 80 

gacttcccga gcggtgtggg gaccgggctc tgacagccct gcggagaccc tgagtccggg 840 

aggcaccgtc cggcggcgag ctctggcttt gcaagggccc ctccttctgg gggcttcgct 9 00 

tccttagcct tgctcaggtg caagtgcccc agggggcggg gtgcagaaga atccgagtgt 9 60 

ttgccaggct taaggagagg agaaactgag aaatgaatgc tgagaccccc ggagcagggg 102 0 

tctgagccac agccgtgctc gcccacaaac tgatttctca cggcgtgtca ccccaccagg 1080 

gcgcaagcct cactattact tgaactttcc aaaacctaaa gaggaaaagt gcaatgcgtg 1140 

ttgtacatac agaggtaact atcaatattt aagtttgttg ctgtcaagat tttttttgta 12 00 

acttcaaata tagagatatt tttgtacgtt atatattgta ttaagggcat tttaaaagca 1260 

attatattgt cctcccctat tttaagacgt gaatgtctca gcgaggtgta aagttgttcg 132 0 

ccgcgtggaa tgtgagtgtg tttgtgtgca tgaaagagaa agactgatta cctcctgtgt 13 80 

ggaagaagga aacaccgagt ctctgtataa tctatttaca taaaatgggt gatatgcgaa 1440 
cagcaaacc 1449 

<210> 62 

<211> 60 

<2 12> DNA 

<2 13> Homo sapiens 

<300> 

<308> NM_001124 
<^00> 62 

gaaggaaaca ccgagtctct gtataatcta tttacataaa <atgggtgata tgcgaacagc 60 

<210> 63 
<211> 1619 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_001168 
<400> 63 

ccgccagatt tgaatcgcgg gacccgttgg cagaggtggc 
gacgttgccc cctgcctggc agccctttct caaggaccac 
ctggcccttc ttggagggct gcgcctgcac cccggagcgg 
ccactgcccc actgagaacg agccagactt ggcccagtgt 
g^aaggctgg gagccagatg acgaccccat agaggaacat 
cgctttcctt tctgtcaaga agcagtttga agaattaacc 
ggacagagaa agagccaaga acaaaattgc aaaggaaacc 
tgaggaaact gcgaagaaag tgcgccgtgc catcgagcag 
cctctggccg gagctgcctg gtcccagagt ggctgcacca 
gtgccaccag ccttcctgtg ggccccttag caatgtctta 
caaattagat gtttcaactg tgctcctgtt ttgtcttgaa 
tgcctgtgca gcgggtgctg ctggtaacag tggctgcttc 
gg-gggctcat ttttgctgtt ttgattcccg ggcttaccag 
a.<aggcagtgt cccttttgct agagctgaca gctttgttcg 
gtgaatgtgt ctggacctca tgttgttgag gctgtcacag 
g-gtgcctgtt gaatctgagc tgcaggttcc ttatctgtca 
acagtttttt tgttgttgtg tttttttgtt tttttttttt 
g-bgatgagag aatggagaca gagtccctgg ctcctctact 



^gcggcggca 


tgggtgcccc 


60 


cgcatctcta 


cattcaagaa 


120 


•atggccgagg 


ctggcttcat 


180 


■ttcttctgct 


tcaaggagct 


240 


aaaaagcatt 


cgtccggttg 


300 


cttggtgaat 


ttttgaaact 


360 


aacaataaga 


agaaagaatt 


420 


ctggctgcca 


tggattgagg 


480 


cttccagggt 


ttattccctg 


540 


ggaaaggaga 


tcaacatttt 


600 


agtggcacca 


gaggtgcttc 


660 


tctctctctc 


tctctttttt 


720 


gtgagaagtg 


agggaggaag 


780 


cgtgggcaga 


gccttccaca 


840 


tcctgagtgt 


ggacttggca 


900 


cacctgtgcc 


tcctcagagg 


960 


ggtagatgca 


tgacttgtgt 


1020 


gtttaacaac 


atggctttct 


1080 
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tafctttgttt gaattgttaa ttcacagaat agcacaaact a_caattaaaa ctaagcacaa 1140 

age cat t eta agtcattggg gaaacggggt gaacttcagg tggatgagga gacagaatag 12 00 

agtgatagga agegtctgge agatactcct tttgccactg ctgtgtgatt agacaggccc 12 60 

agtgagccgc ggggcacatg ctggccgctc ctccctcaga aaaaggcagt ggectaaate 132 0 

ctttttaaat gacttggctc gatgctgtgg gggactggct gjggctgctgc aggccgtgtg 13 80 

tctgtcagcc caaccttcac atctgtcacg ttctccacac gj-ggggagaga cgcagtccgc 144 0 

ccaggtcccc gctttctttg gaggcagcag ctcccgcagg grctgaagtct ggegtaagat 150 0 

gatg-gatttg attcgccctc ctccctgtca tagagctgea gj-ggtggattg ttacagcttc 156 0 

gctggaaacc tctggaggtc ateteggctg ttcctgagaa ataaaaagee tgtcatttc 1619 



<210> 64 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<3O0> 

<30 8> NM_001168 



<400> 64 

ttcacagaat agcacaaact acaattaaaa ctaagcacaa agecattcta agtcattggg 60 

<21_0> 65 

<211> 1552 

<212> DNA 

<2 1. 3> Homo sapiens 

<3O0> 

<308> NM_001216 



<4O0> 65 

gcccgtacac 

agcccctggc 

ctcjtcactgc 

tcccccttgg 

agtgaagagg 

gac/gatctac 

tccctgaagt 

aataatgece 

ccgccctggc 

cgcccccagc 

ctcccgccgc 

cctcctgggc 

ctcjcactggg 

cctgccgaga 

ggcjcgcccgg 

agtgcctatg 

caggtcccag 

tatgaggggt 

cagacagtga 

ggtgactctc 

gacjgcctcct 

aattcctgcc 

accagcgtcg 

gtcjagctacc 

tgtgagaagc 

atgccacttc 



accgtgtgct 
tccctctgtt 
tgcttctgat 
gaggaggctc 
attcacccag 
ctggagagga 
tagaggatct 
acagggacaa 
cccgggtgtc 
tcgccgcctt 
tcccagaact 
tagagatggc 
gggctgeagg 
tccacgtggt 
gaggectgge 
ageagttget 
gactggacat 
ctctgactac 
tgctgagtgc 
ggctacagct 
tccctgctgg 
tggctgctgg 
cgttccttgt 
gcccagcaga 
cagecagagg 
cttttaactg 



gggacacccc 
gatcccggcc 
gcctgtccat 
ttctggggaa 
agaggaggat 
ggatctacct 
acctactgtt 
agaaggggat 
cccagcctgc 
ctgcccggcc 
gcgcctgcgc 
tctgggtccc 
tcgtccgggc 
tcacctcagc 
cgtgttggcc 
gtctcgcttg 
atctgcactc 
accgccctgt 
taagcagctc 
gaacttccga 
agtggacagc 
tgacatccta 
gcagatgaga 
ggtagecgag 
catctgaggg 
ccaagaaatt 



acagtcagcc 
cctgctccag 
ccccagaggt 
gatgacccac 
ccacccggag 
gaagttaagc 
gaggctcctg 
gaccagagtc 
gcgggccgct 
ctgcgccccc 
aacaatggee 
gggcgggagt 
teggagcaca 
accgcctttg 
gectttctgg 
gaagaaatcg 
ctgccctctg 
geccagggtg 
cacaccctct 
gcgacgcagc 
agtcctcggg 
gccctggttt 
aggcagcaca 
actggagect 
ggagccggta 
ttttaaaata 



cjcatggctcc 
gjcctcactgt 
tgccccggat 
tgggcgagga 
atggaggatct 
ctaaatcaga 
gjagatcctca 
enttggcgcta 
tccagtcccc 
tggaactcct 
acagtgtgca 
accgggctct 
otgtggaagg 
ccagagttga 
etggagggccc 
ctgaggaagg 
acttcagccg 
tcatctggac 
ctgacaccct 
otttgaatgg 
ctgctgagcc 
ttggcctcct 
cjaaggggaac 
eigaggctgga 
eictgtcctgt 
aatatttata 



cctgtgcccc 
geaactgetg 
gcaggaggat 
ggatctgccc 
acctggagag 
agaagagggc 
agaaccccag 
tggaggegae 
ggtggatatc 
gggcttccag 
actgaccctg 
geagctgeat 
ccaccgtttc 
egaggecttg 
ggaagaaaac 
ctcagagact 
ctacttccaa 
tgtgtttaac 
gtggggacct 
gcgagtgatt 
agtccagctg 
ttttgctgtc 
caaagggggt 
tcttggagaa 
cctgctcatt 
at 1552 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 



<2IL0> 66 

<211> 60 

<212> DNA 

<2 3_3> Homo sapiens 
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<300> 

<308> NM_001216 
<400> 66 

tcctgtcctg ctcattatgc cacttccttt taactgccaa gaaatttttt aaaataaata 60 

<210> 67 

<211> 2653 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001254 
<400> 67 

gagcgcggct ggagtttgct gctgccgctg tgcagtttg-t tcaggggctt gtggtggtga 60 

gtccgagagg ctgcgtgtga gagacgtgag aaggatcctg cactgaggag gtggaaagaa 12 0 

gaggattgct cgaggaggcc tggggtctgt gaggcagcg-g agctgggtga aggctgcggg 180 

ttccggcgag gcctgagctg tgctgtcgtc atgcctcaaa cccgatccca ggcacaggct 240 

acaatcagtt ttccaaaaag gaagctgtct cgggcattga. acaaagctaa aaactccagt 3 00 

gatgccaaac tagaaccaac aaatgtccaa accgtaacct gttctcctcg tgtaaaagcc 3 60 

ctgcctctca gccccaggaa acgtctgggc gatgacaacc tatgcaacac tccccattta 42 0 

cctccttgtt ctccaccaaa gcaaggcaag aaagagaatg gtccccctca ctcacataca 480 

cttaagggac gaagattggt atttgacaat cagctgacaa ttaagtctcc tagcaaaaga 540 

gaactagcca aagttcacca aaacaaaata ctttcttcag ttagaaaaag tcaagagatc 600 

acaacaaatt ctgagcagag atgtccactg aagaaagaat ctgcatgtgt gagactattc 660 

aagcaagaag gcacttgcta ccagcaagca aagctggtcc tgaacacagc tgtcccagat 72 0 

cggctgcctg ccagggaaag ggagatggat gtcatcagga atttcttgag ggaacacatc 780 

tgtgggaaaa aagctggaag cctttacctt tctggtgctc ctggaactgg aaaaactgcc 840 

tgcttaagcc ggattctgca agacctcaag aaggaactga aaggctttaa aactatcatg 900 

ctgaattgca tgtccttgag gactgcccag gctgtattcc cagctattgc tcaggagatt 960 

tgtcaggaag aggtatccag gccagctggg aaggacatga tgaggaaatt ggaaaaacat 102 0 

atgactgcag agaagggccc catgattgtg ttggtatt^g acgagatgga tcaactggac 1080 

agcaaaggcc aggatgtatt gtacacgcta tttgaatgg-c catggctaag caattctcac 1140 

ttggtgctga ttggtattgc taataccctg gatctcacag atagaattct acctaggctt 12 00 

caagctagag aaaaatgtaa gccacagctg ttgaacttcc caccttatac cagaaatcag 1260 

atagtcacta ttttgcaaga tcgacttaat caggtatcta gagatcaggt tctggacaat 1320 

gctgcagttc aattctgtgc ccgcaaagtc tctgctgttt caggagatgt tcgcaaagca 13 80 

ctggatgttt gcaggagagc tattgaaatt gtagagtc<ag atgtcaaaag ccagactatt 1440 

ctcaaaccac tgtctgaatg taaatcacct tctgagcctc tgattcccaa gagggttggt 1500 

cttattcaca tatcccaagt catctcagaa gttgatggta acaggatgac cttgagccaa 1560 

gaaggagcac aagattcctt ccctcttcag cagaagatct tggtttgctc tttgatgctc 162 0 

ttgatcaggc agttgaaaat caaagaggtc actctgggQ-a agttatatga agcctacagt 1680 

aaagtctgtc gcaaacagca ggtggcggct gtggaccaQ-t cagagtgttt gtcactttca 1740 

gggctcttgg aagccagggg cattttagga ttaaagagaa acaaggaaac ccgtttgaca 1800 

aaggtgtttt tcaagattga agagaaagaa atagaacatg ctctgaaaga taaagcttta 1860 

attggaaata tcttagctac tggattgcct taaattcttc tcttacaccc cacccgaaag 192 0 

tattcagctg gcatttagag agctacagtc ttcattttag tgctttacac attcgggcct 1980 

gaaaacaaat atgacctttt ttacttgaag ccaatgaatt ttaatctata gattctttaa 2040 

tattagcaca gaataatatc tttgggtctt actattttta cccataaaag tgaccaggta 2100 

gacccttttt aattacattc actacttcta ccacttgt^t atctctagcc aatgtgcttg 2160 

caagtgtaca gatctgtgta gaggaatgtg tgtatattta cctcttcgtt tgctcaaaca 2220 

tgagtgggta tttttttgtt tgtttttttt gttgttgttg tttttgaggc gcgtctcacc 2280 

ctgttgccca ggctggagtg caatggcgcg ttctctgctc actacagcac ccgcttccca 2340 

ggttgaagtg attctcttgc ctcagcctcc cgagtagctg ggattacagg tgcccaccac 2400 

cgcgcccagc taatttttta atttttagta gagacagggt tttaccatgt tggccaggct 2460 

ggtcttgaac tcctgaccct caagtgatct gcccaccttg gcctccctaa gtgctgggat 252 0 

tataggcgtg agccaccatg ctcagccatt aaggtatttt gttaagaact ttaagtttag 2580 

ggtaagaaga atgaaaatga tccagaaaaa tgcaagcaag tccacatgga gatttggagg 2 640 
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acactggtta aag 2 653 

<210> 68 

<211> 60 

<212> DNA 

<2 13> Homo sapiens 

<300> 

<3 08> NM_001254 
<4L00> 68 

caaggaaacc cgtttgacaa aggtgttttt caagattgaa gagaaagaaa tagaacatgc 6 0 

<210> 69 

<211> 627 

<212> DNA 

<2 13> Homo sapiens 

<300> 

<3 08> NM_001323 
<400> 69 

gcggccgcaa gctcggcact cacggctctg agggctccga 
cgttcgaacc tcccgctggc gctgggcctg gccctggtcg- 
ccacgcgacg cccgggcccg gccgcaggag cgcatggtcg- 
cccgacgacc cgcaggtgca gaaggcggcg caggcggccp; 
agcaacagca tctactactt ccgagacacg cacatcatca 
gccggcatca agtacttcct gacgatggag atggggagca. 
gtcactggag accacgtcga cctcaccact tgccccctgg 
aagctgcgct gtgactttga ggtccttgtg gttccctggc 
aagcacaact gtgtgcagat gtgataagtc cccgagggco 
catggtggag ggcacttcag gtccgtgggc cgtatctgtc 
cttcttgcaa aaaaaaaaaa aaaaaaa 627 

<210> 70 
<211> 60 
<212> DNA 
<2 13> Homo sapiens 

<300> 

<308> NM_001323 
<400> 70 

atcaagtact tcctgacgat ggagatgggg 

<210> 71 
<211> 1812 
<212> DNA 
< 2 13> Homo sapiens 

<300> 

<308> NM_001428 
<400> 71 

tagctaggca ggaagtcggc gcgggcggcg 

gagatctcgc cggctttacg ttcacctcgg 

ggcgacgaga cccagtggct agaagttcac 

gatctttgac tctcgcggga atcccactgt 

cttcagagct gctgtgccca gtggtgcttc 

ggacaatgat aagactcgct atatggggaa 

taaaactatt gcgcctgccc tggttagcaa 



cggcactgac ggccatggcg 60 

cattctgcct cctggcgctg 12 0 

gagaactccg ggacctgtcg 180 

tggccagcta caacatgggc 240 

aggcgcagag ccagctggtg 3 00 

cagactgccg caagaccagg 3 60 

cagcaggggc gcagcaggag 42 0 

agaactcctc tcagctccta 480 

aaggccattg ggtttggggc 540 

acaataaatg gccagtgctg 600 



agcacagact gccgcaagac cagggtcact 60 



cggacagtat ctgtgggtac ccggagcacg 60 

tgtctgcagc accctccgct tcctctccta 12 0 

catgtctatt ctcaagatcc atgccaggga 180 

tgaggttgat: ctcttcacct caaaaggtct 240 

aactggtatc tatgaggccc tagagctccg 3 00 

gggtgtctcs aaggctgttg agcacatcaa 3 60 

gaaactgaac gtcacagaac aagagaagat 42 0 
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tgacaaactg 
cattctgggg 
gtaccgccac 
caatgtcatc 
cctcccagtc 
caacctgaag 
aggcgggttt 
tattgggaaa 
gttcttcagg 
catctcgcct 
gtctatcgaa 
tgcaggaatc 
ggccgtgaac 
gaccgagtct 
tcatcgttcg 
tgggcagatc 
cctcagaatt 
ccccttggcc 
cccctcccct 
ctgctagtta 
gccaagctcc 
tcattgtttt 
tctccggggt 
gtgacccatg 



atgatcgaga 
gtgtcccttg 
atcgctgact 
aatggcggtt 
ggtgcagcaa 
aatgtcatca 
gctcccaaca 
gctggctaca 
tctgggaagt 
gaccagctgg 
gatccctttg 
caggtagtgg 
gagaagtcct 
cttcaggcgt 
ggggagactg 
aagactggtg 
gaagaggagc 
aagtaagctg 
cgtgtcagct 
gcgccccacc 
ctggagccct 
tctcgcctca 
ggccacaggc 
ag 1812 



tggatggaac 
ccgtctgcaa 
tggctggcaa 
ctcatgctgg 
acttcaggga 
aggagaaata 
tcctggagaa 
ctgataaggt 
atgacctgga 
ctgacctgta 
accaggatga 
gggatgatct 
gcaactgcct 
gcaagctggc 
aagatacctt 
ccccttgccg 
tgggcagcaa 
tgggcaggca 
caggcagctc 
gccgtggagt 
gttggcagct 
ctttccacca 
tagatccccg 



agaaaataaa 
agctggtgcc: 
ctctgaagtc 
caacaagctg 
agccatgcgc 
tgggaaagafc 
taaagaaggc 
ggtcatcggc 
cttcaagtct 
caagtccttc 
ctggggag-cfc 
cacagtgacc 
cctgctcaaa 
ccaggccaat 
catcgctgac 
atctgagcgc 
ggctaagttt 
agcccttcgg" 
gaggcccccg 
tcgtaccgct 
ctagctttgc 
agtgtctaga 
gtggttttgt 



tctaagtttg 
gttgagaagg 
atcctgccag 
gccatgcagg 
attggagcag 
gccaccaatg 
ctggagctgc 
atggacgtag 
cccgatgacc 
atcaaggact 
tggcagaagt 
aacccaaaga 
gtcaaccaga 
ggttggggcg 
ctggttgtgg 
ttggccaagt 
gccggcagga 
tcacctgttg 
accaacactt 
tccttagaac 
agtcgtgtaa 
gtcatgtgag 
gctcaaaata 



gtgcgaacgc 
gggtccccct 
tcccggcgtt 
agttcatgat 
aggtttacca 
tgggggatga 
tgaagactgc 
cggcctccga 
ccagcaggta 
acccagtggt 
tcacagccag 
ggatcgccaa 
ttggctccgt 
tcatggtgtc 
ggctgtgcac 
acaaccagct 
acttcagaaa 
gctacacaga 
gcaggggtcc 
ttctacagaa 
ttggcccaag 
cctcgtgtca 
aaaagcctca 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 



<210> 72 
<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001428 



<400> 72 

agctctagct tttgcagtcg tgtaatgggc ccaagtcatt gtttttctcg cctcactttc 



60 



<210> 73 

<211> 8368 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001456 



<400> 73 

gcgatccggg 

acctctgctc 

cgtcccctcg 

ctcccactct 

gcgggacgcc 

gatccagcag 

catcgccaac 

gctcagccag 
gcttgagaac 
catcgacagc 
cctgatcctg 
caagaagcag 
gcccatcacc 
cagctgtgcc 



cgccaccccg 
cctgcctcgc 
ccgaggctgc 
cgggcgggcc 
gagatgccgg 
aacactttca 
ctgcagacgg 

aagaagatgc 
gtgtcggtgg 
aaggccatcg 
cactactcca 
acccccaagc 
aacttcagcc 
ccgggcctgt 



cggtcatcgg 
ctcccgcgcg 
agcaccgccc 
agagcgcagc 
ccaccgagaa 
cgcgctggtg 
acctgagcga 

accgcaagca 
cgctcgagtt 
tggacgggaa 
tctccatgcc 
agaggctcct 
gggactggca 
gtcctgactg 



tcaccggtcg 
cctaggtgcc 
ccccggcttc 
aggcgcggc-t 
ggacctggc^ 
caacgagcac 
cgggctgcg^ 

caaccagcgg 
cctggaccgc 
cctgaagctg 
catgtgggac 
gggctggato 
gagcggccgg- 
ggactcttg^ 



ctctcaggaa 
tgcgacttta 
tcgcgcctca 
ccgggcggcg 
gaggacgcgc 
ctgaagtgcg 
cttatcgcgc 

cccactttcc 
gagagcatca 
atcctgggcc 
gaggaggagg 
cagaacaagc 
gccctgggcg 
gacgccagca 



cagcagcgca 
attaaagggc 
aaatgagtag 
gcgtcgacac 
cgtggaagaa 
tgagcaagcg 
tgttggaggt 

gccaaatgca 
aactggtgtc 
tcatctggac 
atgaggaggc 
tgccgcagct 
ccctggtgga 
agcccgttac 



60 

120 

180 

240 

300 

360 

420 

480 
540 
600 
660 
720 
780 
840 



34 



WO 2005/086891 



PCT/US2005/007894 



caatgcgcga gaggccatgc agcaggcgga tgactg-gctg ggcatccccc aggtgatcac 9 00 

ccccgaggag attgtggacc ccaacgtgga cgagcactct gtcatgacct acctgtccca 9 60 

gttccccaag gccaagctga agccaggggc tcccttgcgc cccaaactga acccgaagaa 102 0 

agcccgtgcc tacgggccag gcatcgagcc cacaggcaac atggtgaaga agcgggcaga 1080 

gttcactgtg gagaccagaa gtgctggcca gggageiggtg ctggtgtacg tggaggaccc 1140 

ggccggacac caggaggagg caaaagtgac cgccaataac gacaagaacc gcaccttctc 1200 

cgtctggtac gtccccgagg tgacggggac tcataaggtt actgtgctct ttgctggcca 12 60 

gcacatcgcc aagagcccct tcgaggtgta cgtgg-aitaag tcacagggtg acgccagcaa 1320 

agtgacagcc caaggtcccg gcctggagcc cagtggcaac atcgccaaca agaccaccta 13 80 

ctttgagatc tttacggcag gagctggcac gggcg*siggtc gaggttgtga tccaggaccc 1440 

catgggacag aagggcacgg tagagcctca gctgg-<aggcc cggggcgaca gcacataccg 1500 

ctgcagctac cagcccacca tggagggcgt ccacaccgtg cacgtcacgt ttgccggcgt 1560 

gcccatccct cgcagcccct acactgtcac tgttg-^ccaa gcctgtaacc cgagtgcctg 1620 

ccgggcggtt ggccggggcc tccagcccaa gggtg-tgcgg gtgaaggaga cagctgactt 1680 

caaggtgtac acaaagggcg ctggcagtgg ggagctgaag gtcaccgtga agggccccaa 1740 

gggagaggag cgcgtgaagc agaaggacct ggggg*«a.tggc gtgtatggct tcgagtatta 1800 

ccccatggtc cctggaacct atatcgtcac catcacgtgg ggtggtcaga acatcgggcg 1860 

cagtcccttc gaagtgaagg tgggcaccga gtgtg-^caat cagaaggtac gggcctgggg 192 0 

ccctgggctg gagggcggcg tcgttggcaa gtcagcagac tttgtggtgg aggctatcgg 1980 

ggacgacgtg ggcacgctgg gcttctcggt ggaagggcca tcgcaggcta agatcgaatg 2 040 

tgacgacaag ggcgacggct cctgtgatgt gcgctactgg ccgcaggagg ctggcgagta 2100 

tgccgttcac gtgctgtgca acagcgaaga catccgcctc agccccttca tggctgacat 2160 

ccgtgacgcg ccccaggact tccacccaga cagggtgaag gcacgtgggc ctggattgga 2220 

gaagacaggt gtggccgtca acaagccagc agagttcaca gtggatgcca agcacggtgg 2280 

caaggcccca cttcgggt cc aagtccagga caatgaaggc tgccctgtgg aggcgttggt 2340 

caaggacaac ggcaatggca cttacagctg ctcctacgtg cccaggaagc cggtgaagca 2400 

cacagccatg gtgtcctggg gaggcgtcag catccccaac agccccttca gggtgaatgt 2460 

gggagctggc agccacccca acaaggtcaa agtatacggc cccggagtag ccaagacagg 2 52 0 

gctcaaggcc cacgagccca cctacttcac tgtgg-a.ctgc gccgaggctg gccaggggga 2 5 80 

cgtcagcatc ggcatcaagt gtgcccctgg agtgg-taggc cccgccgaag ctgacatcga 2 640 

cttcgacatc atccgcaatg acaatgacac cttcacggtc aagtacacgc cccggggggc 2700 

tggcagctac accattatgg tcctctttgc tgaccaggcc acgcccacca gccccatccg 2760 

agtcaaggtg gagccctctc atgacgccag taaggj-tgaag gccgagggcc ctggcctcag 2 82 0 

tcgcactggt gtcgagcttg gcaagcccac ccacttcaca gtaaatgcca aagctgctgg 2880 

caaaggcaag ctggacgtcc agttctcagg actcaccaag ggggatgcag tgcgagatgt 2940 

ggacatcatc gaccaccatg acaacaccta cacagtcaag tacacgcctg tccagcaggg 3 000 

tccagtaggc gtcaatgtca cttatggagg ggatcccatc cctaagagcc ctttctcagt 3 060 

ggcagtatct ccaagcctgg acctcagcaa gatca_aggtg tctggcctgg gagagaaggt 3120 

ggacgttggc aaagaccagg agttcacagt caaatcaaag ggtgctggtg gtcaaggcaa 3180 

agtggcatcc aagattgtgg gcccctcggg tgcacjcggtg ccctgcaagg tggagccagg 3240 

cctgggggct gacaacagtg tggtgcgctt cctgccccgt gaggaagggc cctatgaggt 3300 

ggaggtgacc tatgacggcg tgcccgtgcc tggcagcccc tttcctctgg aagctgtggc 33 60 

ccccaccaag cctagcaagg tgaaggcgtt tgggccgggg ctgcagggag gcagtgcggg 3420 

ctcccccgcc cgcttcacca tcgacaccaa gggccjccggc acaggtggcc tgggcctgac 3480 

ggtggagggc ccctgtgagg cgcagctcga gtgcttggac aatggggatg gcacatgttc 3540 

cgtgtcctac gtgcccaccg agcccgggga ctacaacatc aacatcctct tcgctgacac 3 600 

ccacatccct ggctccccat tcaaggccca cgtgcjttccc tgctttgacg catccaaagt 3660 

caagtgctca ggccccgggc tggagcgggc cacccjctggg gaggtgggcc aattccaagt 372 0 

ggactgctcg agcgcgggca gcgcggagct gaccsLttgag atctgctcgg aggcggggct 37 80 

tccggccgag gtgtacatcc aggaccacgg tgatcjgcacg cacaccatta cctacattcc 3 840 

cctctgcccc ggggcctaca ccgtcaccat caagtacggc ggccagcccg tgcccaactt 3900 

ccccagcaag ctgcaggtgg aacctgcggt ggacacttcc ggtgtccagt gctatgggcc 3960 

tggtattgag ggccagggtg tcttccgtga ggccaccact gagttcagtg tggacgcccg 402 0 

ggctctgaca cagaccggag ggccgcacgt caagcjcccgt gtggccaacc cctcaggcaa 4080 

cctgacggag acctacgttc aggaccgtgg cgatcjgcatg tacaaagtgg agtacacgcc 4140 

ttacgaggag ggactgcact ccgtggacgt gacctzatgac ggcagtcccg tgcccagcag 42 00 

ccccttccag gtgcccgtga ccgagggctg cgacccctcc cgggtgcgtg tccacgggcc 42 60 

aggcatccaa agtggcacca ccaacaagcc caacaagttc actgtggaga ccaggggagc 4320 

tggcacgggc ggcctgggcc tggctgtaga gggcccctcc gaggccaaga tgtcctgcat 43 80 

ggataacaag gacggcagct gctcggtcga gtacatccct tatgaggctg gcacctacag 4440 

cctcaacgtc acctatggtg gccatcaagt gccacjgcagt cctttcaagg tccctgtgca 45 00 
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tgatgtgaca gatgcgtcca aggtcaagtg ctctgggccc ggcctgagcc caggcatggt 4560 

tcgtgccaac ctccctcagt ccttccaggt ggacacaagc aaggctggtg tggccccatt 462 0 

gcaggtcaaa gtgcaagggc ccaaaggcct ggtggag-cca gtggacgtgg tagacaacgc 4680 

tgatggcacc cagaccgtca attatgtgcc cagccgagaa gggccctaca gcatctcagt 4740 

a_ctgtatgga gatgaagagg taccccggag ccccttcaag gtcaaggtgc tgcctactca 4800 

tgatgccagc aaggtgaagg ccagtggccc cgggctcaac accactggcg tgcctgccag 4860 

cctgcccgtg gagttcacca tcgatgcaaa ggacgccggg gagggcctgc tggctgtcca 492 0 

gatcacggat cccgaaggca agccgaagaa gacacacatc caagacaacc atgacggcac 4980 

g-tatacagtg gcctacgtgc cagacgtgac aggtcgcrttac accatcctca tcaagtacgg 5 040 

tg-gtgacgag atccccttct ccccgtaccg cgtgcgt:gcc gtgcccaccg gggacgccag 5100 

caagtgcact gtcacagtgt caatcggagg tcacggg-cta ggtgctggca tcggccccac 5160 

cattcagatt ggggaggaga cggtgatcac tgtggacact aaggcggcag gcaaaggcaa 522 0 

a-gtgacgtgc accgtgtgca cgcctgatgg ctcagag-gtg gatgtggacg tggtggagaa 52 80 

tgaggacggc actttcgaca tcttctacac ggccccccag ccgggcaaat acgtcatctg 5340 

tgtgcgcttt ggtggcgagc acgtgcccaa cagccccttc caagtgacgg ctctggctgg 5400 

ggaccagccc tcggtgcagc cccctctacg gtctcagcag ctggccccac agtacaccta 5460 

cgcccagggc ggccagcaga cttgggcccc ggagaggccc ctggtgggtg tcaatgggct 5520 

g-gatgtgacc agcctgaggc cctttgacct tgtcatcccc ttcaccatca agaagggcga 5580 

gatcacaggg gaggttcgga tgccctcagg caaggtg-gcg cagcccacca tcactgacaa 5640 

caaagacggc accgtgaccg tgcggtatgc acccagcgag gctggcctgc acgagatgga 57 00 

catccgctat gacaacatgc acatcccagg aagccccttg cagttctatg tggattacgt 5760 

caactgtggc catgtcactg cctatgggcc tggcctcacc catggagtag tgaacaagcc 5 82 0 

tgccaccttc accgtcaaca ccaaggatgc aggaga^ggg ggcctgtctc tggccattga 5880 

gggcccgtcc aaagcagaaa tcagctgcac tgacaaccag gatgggacat gcagcgtgtc 5940 

ctacctgcct gtgctgccgg gggactacag cattctagtc aagtacaatg aacagcacgt 6 000 

cccaggcagc cccttcactg ctcgggtcac aggtgacgac tccatgcgta tgtcccacct 6060 

aaaggtcggc tctgctgccg acatccccat caacatctca gagacggatc tcagcctgct 6120 

gacggccact gtggtcccgc cctcgggccg ggaggag-ccc tgtttgctga agcggctgcg 6180 

taatggccac gtggggattt cattcgtgcc caaggagacg ggggagcacc tggtgcatgt 6240 

gaagaaaaat ggccagcacg tggccagcag ccccatcccg gtggtgatca gccagtcgga 63 00 

aattggggat gccagtcgtg ttcgggtctc tggtcag-ggc cttcacgaag gccacacctt 63 60 

tgagcctgca gagtttatca ttgatacccg cgatgcaggc tatggtgggc tcagcctgtc 6420 

cattgagggc cccagcaagg tggacatcaa cacagaggac ctggaggacg ggacgtgcag 6480 

ggtcacctac tgccccacag agccaggcaa ctacatcatc aacatcaagt ttgccgacca 6540 

gcacgtgcct ggcagcccct tctctgtgaa ggtgacaggc gagggccggg tgaaagagag 6 600 

catcacccgc aggcgtcggg ctccttcagt ggccaacgtt ggtagtcatt gtgacctcag 6660 

cctgaaaatc cctgaaatta gcatccagga tatgacagcc caggtgacca gcccatcggg 6720 

caagacccat gaggccgaga tcgtggaagg ggagaaccac acctactgca tccgctttgt 6780 

tcccgctgag atgggcacac acacagtcag cgtcaagtac aagggccagc acgtgcctgg 6 840 

gagccccttc cagttcaccg tggggcccct aggggasggg ggagcccaca aggtccgagc 6900 

fcgggggccct ggcctggaga gagctgaagc tggagtcjcca gccgaattca gtatctggac 6960 

ccgggaagct ggtgctggag gcctggccat tgctgtogag ggccccagca aggctgagat 7 020 

otcttttgag gaccgcaagg acggctcctg tggtgt^gct tatgtggtcc aggagccagg 7 080 

tgactacgaa gtctcagtca agttcaacga ggaacacatt cccgacagcc ccttcgtggt 7140 

Q-cctgtggct tctccgtctg gcgacgcccg ccgcctcact gtttctagcc ttcaggagtc 72 00 

agggctaaag gtcaaccagc cagcctcttt tgcagtcagc ctgaacgggg ccaagggggc 72 60 

gatcgatgcc aaggtgcaca gcccctcagg agccctggag gagtgctatg tcacagaaat 732 0 

tgaccaagat aagtatgctg tgcgcttcat ccctcg^gag aatggcgttt acctgattga 73 80 

cgtcaagttc aacggtaccc acatccctgg aagccccttc aagatccgag ttggggagcc 7440 

fcgggcatgga ggggacccag gcttggtgtc tgcttacgga gcaggtctgg aaggcggtgt 7 500 

cacagggaac ccagctgagt tcgtcgtgaa cacgagcaat gcgggagctg gtgccctgtc 7 560 

g-gtgaccatt gacggcccct ccaaggtgaa gatggattgc caggagtgcc ctgagggcta 7 62 0 

ccgcgtcacc tataccccca tggcacctgg cagctacctc atctccatca agtacggcgg 7680 

cccctaccac attgggggca gccccttcaa ggccaaagtc acaggccccc gtctcgtcag 7740 

caaccacagc ctccacgaga catcatcagt gtttgt«agac tctctgacca aggccacctg 7 800 

fcgccccccag catggggccc cgggtcctgg gcctgctgac gccagcaagg tggtggccaa 7 860 

g-ggcctgggg ctgagcaagg cctacgtagg ccagaagagc agcttcacag tagactgcag 7920 

caaagcaggc aacaacatgc tgctggtggg ggttca-fcggc ccaaggaccc cctgcgagga 7980 

gatcctggtg aagcacgtgg gcagccggct ctacagcgtg tcctacctgc tcaaggacaa 8040 

g-ggggagtac acactggtgg tcaaatgggg gcacgagcac atcccaggca gcccctaccg 8100 

ogttgtggtg ccctgagtct ggggcccgtg ccagccggca gcccccaagc ctgccccgct 8160 
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acccaagcag ccccgccctc ttcccctcaa ccccggccca ggccgccctg gccgcccgcc 822 0 

tgtcactgca gctgcccctg ccctgtgccg tgctgcgctc acctgcctcc ccagccagcc 8280 

gctgacctct cggctttcac ttgggcagag ggagccattt ggtggcgctg cttgtcttct 8340 

ttggttctgg gaggggtgag ggatgggg 83 68 

<210> 74 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001456 
<400> 74 

tgacctctcg gctttcactt gggcagaggg agccatttgg tggcgctgct tgtcttcttt 60 

<210> 75 

<211> 1642 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001548 
<400> 75 

ccagatctca gaggagcctg gctaagcaaa accctgcaga acggctgcct aatttacagc 60 

aaccatgagt acaaatggtg atgatcatca ggtcaetggat agtctggagc aattgagatg 120 

tcactttaca tgggagttat ccattgatga cgatgataatg cctgatttag aaaacagagt 180 

cttggatcag attgaattcc tagacaccaa atacacjtgtg ggaatacaca acctactagc 240 

ctatgtgaaa cacctgaaag gccagaatga ggaagccctg aagagcttaa aagaagctga 3 00 

aaacttaatg caggaagaac atgacaacca agcaaatgtg aggagtctgg tgacctgggg 3 60 

caactttgcc tggatgtatt accacatggg cagactggca gaagcccaga cttacctgga 420 

caaggtggag aacatttgca agaagctttc aaatcccttc cgctatagaa tggagtgtcc 480 

agaaatagac tgtgaggaag gatgggcctt gctgaagtgt ggaggaaaga attatgaacg 540 

ggccaaggcc tgctttgaaa aggtgcttga agtggaccct gaaaaccctg aatccagcgc 600 

tgggtatgcg atctctgcct atcgcctgga tggctttaaa ttagccacaa aaaatcacaa 660 

gccattttct ttgcttcccc taaggcaggc tgtccgjctta aatccagaca atggatatat 72 0 

taaggttctc cttgccctga agcttcagga tgaaggacag gaagctgaag gagaaaagta 780 

cattgaagaa gctctagcca acatgtcctc acagacctat gtctttcgat atgcagccaa 840 

gttttaccga agaaaaggct ctgtggataa agctcttgag ttattaaaaa aggccttgca 900 

ggaaacaccc acttctgtct tactgcatca ccagataggg ctttgctaca aggcacaaat 960 

gatccaaatc aaggaggcta caaaagggca gcctag-aggg cagaacagag aaaagctaga 102 0 

caaaatgata agatcagcca tatttcattt tgaatctgca gtggaaaaaa agcccacatt 1080 

tgaggtggct catctagacc tggcaagaat gtatatagaa gcaggcaatc acagaaaagc 1140 

tgaagagaat tttcaaaaat tgttatgcat gaaaccagtg gtagaagaaa caatgcaaga 1200 

catacatttc tactatggtc ggtttcagga atttcaaaag aaatctgacg tcaatgcaat 1260 

tatccattat ttaaaagcta taaaaataga acaggcatca ttaacaaggg ataaaagtat 132 0 

caattctttg aagaaattgg ttttaaggaa acttcggaga aaggcattag atctggaaag 13 80 

cttgagcctc cttgggttcg tctataaatt ggaaggaaat atgaatgaag ccctggagta 1440 

ctatgagcgg gccctgagac tggctgctga ctttgagaac tctgtgagac aaggtcctta 1500 

ggcacccaga tatcagccac tttcacattt catttcattt tatgctaaca tttactaatc 1560 

atcttttctg cttactgttt tcagaaacat tataattcac tgtaatgatg taattcttga 1620 
ataataaatc tgacaaaata tt 1642 

<210> 76 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 
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<308> NM„001548 
<400> 76 

gtatcaattc tttgaagaaa ttggttttaa ggaaacttcg gagaaaggca ttagatctgg 60 

<210> 77 

<211> 3344 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_0 01605 
<400> 77 

ggtacagctg cgcgtctgcg ggaataggtg cagcgggccc ttggcggggg actctgaggg 60 

aggagctggg gacggcgacc ctaggagagt tctttg-gggt gactttcaag atggactcta 12 0 

ctctaacagc aagtgaaatc cggcagcgat ttatag-attt cttcaagagg aacgagcata 180 

cgtatgttca ctcgtctgcc accatcccat tggatgaccc cactttgctc tttgccaatg 240 

caggcatgaa ccagtttaaa cccattttcc tgaacacaat tgacccatct caccccatgg 300 

caaagctgag cagagctgcc aatacccaga agtgcatccg ggctgggggc aaacaaaatg 3 60 

acctggacga tgtgggcaag gatgtctatc atcacacctt cttcgagatg ctgggctctt 42 0 

ggtcttttgg agattacttt aaggaattgg catgtaagat ggctctggaa ctcctcaccc 480 

aagagtttgg cattcccatt gaaagacttt atgttca.ctta ctttggcggg gatgaagcag 540 

ctggcttaga agcagatctg gaatgcaaac agatctggca aaatttgggg ctggatgaca 600 

ccaaaatcct cccaggcaac atgaaggata acttcfcggga gatgggtgac acgggcccct 660 

gtggtccttg cagtgagatc cactacgacc ggatt^gtgg tcgggacgcc gcacatcttg 720 

tcaaccagga cgaccctaat gtgctggaga tctggaacct tgtgttcatc cagtataaca 7 80 

gggaagctga tggcattctg aaacctcttc ccaagaaaag cattgacaca gggatgggcc 840 

tggaacgact ggtatctgtg ctgcagaata agatgtccaa ctatgacact gacctttttg 900 

tcccttactt tgaagccatt cagaagggca caggtgcccg accatacact gggaaagttg 960 

gtgctgagga tgccgatggg attgacatgg cctaccgggt gctggctgac catgctcgga 102 0 

ccatcactgt ggcactggct gatggtggcc ggcctgacaa cacagggcgt ggatatgtgt 1080 

tgagacggat tctccgccga gctgtccgat acgcccatga aaagctcaat gccagcaggg 1140 

gcttctttgc tacgttagtg gatgttgtcg tccagtccct gggagatgca tttcctgagc 12 00 

tgaagaagga cccagacatg gtgaaggaca tcattaatga agaagaggtg cagtttctca 12 60 

agactctcag cagagggcgt cgcatcctgg acaggaaaat tcagagcctg ggagacagca 132 0 

agaccattcc cggagacact gcttggctcc tctatgacac ctatgggttt ccagtggatc 13 80 

tgactggact gattgctgaa gagaagggcc tggtgg-taga catggatggc tttgaagagg 1440 

agaggaaact ggcccagctg aaatcacagg gcaagQgagc tggtggggaa gacctcatta 150 0 

tgctggacat ttacgctatc gaagagctcc gggcacgggg tctggaggtc acagatgatt 156 0 

ccccaaagta caattaccat ttggactcca gtggteigcta tgtatttgag aacacagtgg 162 0 

ctacggtgat ggctctgcgc agggagaaga tgttcQ-tgga agaggtgtcc acaggccagg 1680 

agtgtggagt ggtgctggac aagacctgtt tctat^ctga gcaaggaggc cagatctatg 1740 

acgaaggcta cctggtgaag gtggatgaca gcagtgaaga taaaacagag tttacagtga 1800 

agaatgctca ggtccgagga gggtatgtgc tacacattgg aaccatctac ggtgacctga 1860 

aagtggggga tcaggtctgg ctgtttattg atgagccccg acgaagaccc atcatgagca 192 0 

accacacagc tacgcacatt ctgaacttcg ccctgcgctc agtgcttggg gaagctgacc 1980 

agaaaggctc attggttgct cctgaccgcc tcagatttga ctttactgcc aagggagcca 2 040 

tgtccaccca acagatcaag aaggctgaag agattgctaa tgagatgatt gaggcagcca 2100 

aggccgtcta tacccaggat tgccccctgg cagcagcgaa agccatccag ggcctacggg 2160 

ctgtgtttga tgagacctat cctgaccctg tgcgag-tcgt ctccattggg gtcccggtgt 222 0 

ccgagttgct ggatgacccc tctgggcctg ctggctccct gacttctgtt gagttctgtg 22 80 

ggggaacgca cctgcggaac tcgagtcatg cagga^cttt tgtgatcgtg acggaagaag 2340 

ccattgccaa gggtatccgg aggattgtgg ctgtcacagg tgccgaggcc cagaaggccc 2400 

tcaggaaagc agagagcttg aagaaatgtc tctct^tcat ggaagccaaa gtgaaggctc 2460 

agactgctcc aaacaaggat gtgcagaggg agatcgctga ccttggagag gccctggcca 2 52 0 

ctgcagtcat cccccagtgg cagaaggatg aattgcggga gactctcaaa tccctaaaga 2580 

aggtcatgga tgacttggac cgagccagca aagccgatgt ccagaaacga gtgttagaga 2 640 

agacgaagca gttcatcgac agcaacccca accagcctct tgtcatcctg gagatggaga 2700 

gcggcgcctc agccaaggcc ctgaatgaag ccttgaagct cttcaagatg cactcccctc 2760 
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agacttctgc catgctcttc acggtggaca atgag^ctgg caagatcacg tgcctgtgtc 2 82 0 

aagtccccca gaatgcagcc aatcggggct taaaagccag cgagtgggtg cagcaggtgt 2 880 

caggcttgat ggacggtaaa ggtggtggca aggat^tgtc tgcacaggcc acaggcaaga 2 940 

acgttggctg cctgcaggag gcgctgcagc tggccacttc cttcgcccag ctgcgcctcg 3 00 0 

gggatgtaaa gaactgagtg gggaaggagg aggctcccac tggatccatc cgtccagcca 3 06 0 

agagctcttc atctgctaca agaacatttg aatcttggga cctttaaaga gcccctccta 312 0 

acccagcagt aactggaaca cacttgggag cagtcctatg tctcagtgcc ccttaaattt 3180 

ctgccctgag ccctccacgt cagtgccatc ggtctagaac cactaacccc gcattgctgt 3240 

tgatcgtcac gctcgcatct atagataacg gctctccaga cctgagcttt ccgcgtcagc 3 30 0 
aagtaggaat cgtttttgct gcagagaata aaaggaccac gtgc 3344 



<210> 78 
<211> 60 
<212> DNA 
<213> Homo 



sapiens 



<300> 

<308> NM_001605 



<400> 78 

gccaagagct cttcatctgc tacaagaaca tttgaatctt gggaccttta aagagcccct 6 0 



<210> 79 

<211> 417 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001645 



<400> 79 

acctcccaac caagccctcc agcaaggatt caggagtgcc cctcgggcct cgccatgagg 60 

ctcttcctgt cgctcccggt cctggtggtg gttctgtcga tcgtcttgga aggcccagcc 12 0 

ccagcccagg ggaccccaga cgtctccagt gccttggata agctgaagga gtttggaaac 180 

acactggagg acaaggctcg ggaactcatc age eg- cat ca aacagagtga actttctgcc 2 40 

aagatgcggg agtggttttc agagacattt cagaaagtga aggagaaact caagattgac 3 00 

tcatgaggac ctgaagggtg acatccagga ggggcctctg aaatttccca caccccagcg 3 60 
cctgtgctga ggactcccgc catgtggccc caggtgccac caataaaaat cctaccg 417 



<210> 80 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_001645 



<400> 80 

aaacagagtg aactttctgc caagatgegg gagtcjgtttt cagagacatt tcagaaagtg 60 

<210> 81 

<211> 1389 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM„001809 
<400> 81 

cgcggacttc tgccaagcac eggctcatgt gaggctcgeg geacagegtt ctctgggctc 60 
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cccagaagcc 
cgccagcacc 
ccgaggaggc 
ttaggcgctt 
aagcttcaga 
gaaatatgtg 
gccctacaag 
accttacatg 
cggggccttg 
tgctcagcca 
tctgcgatgc 
ttttcttttt 
tgagacaatc 
gttgacttta 
tgctgtggta 
cattttttac 
actttgagca 
gtagcctcac 
tgtagctttt 
gttacttcta 
attttactct 
ttcgatgctt 
atttaatga 



agcctttcgc 
ctctgcggcg 
gcagcccgag 
cctcccatca 
agagcacaca 
ttaaattcac 
aggcagcaga 
caggccgagt 
aggagggact 

ggggggatga 

tgtctggact 
tttgagaagg 
aacaccgttc 
gtttgtgagt 
tgggagaaaa 
catatgtaca 
gttgcctgga 
cattagtggc 
cagaaactta 
gtaaattcct 
tgtaaatcat 
ttatgggtat 
1389 



tcccggaccc 
tgtcatgggc 
cccgaccccg 
acacagtcgg 
cctcttgata 
tcgtggtgtg 
agcatttcta 
tactctcttc 
cggctgagct 
taccggggac 
ttgctgtctc 
agaagactgc 
caaaggcctg 
tactcatgtg 
ggttatgtac 
tttgtacttt 
aggctgggca 
agcatcatgt 
attggggatg 
gtcaaaatca 
agaagatgta 
ttttgtagtt 



ggcagcccga 
ccgcgccgcc 
acccccggcc 
cggetgacaag 
aggatagctgc 
gacttcaatt 
gttcatctct 
ccaaaggatg 
cctgcaccca 
tctccagagc 
tgaacagtat 
atgactttcc 
aaaataattt 
actatttgag 
ttattatttt 
tattttacac 
tttczcatcat 
aactgagtgg 
aatagaaaac 
attcagaaat 
tcataacagt 
tctttgtaga 



gcaggagccg 
ggagccgaaa 
cctcccggcg 
gttggctaaa 
ccttcagccg 
ggcaagccca 
ttgaggacgc 
tgcaactggc 
gtgtttctgt 
catgactaga 
gtgtgtgttg 
tctgtaacag 
tcagataaag 
gattttgaaa 
agctctttct 
ataagggaaa 
atagacctct 
actgtgcttg 
ctgtaagctt 
tctaacttgg 
tcagaatttt 
gagataataa 



tgggaccggg 
gcccgaggcc 
gggcccctcc 
ggagatccga 
cctggcaaga 
ggccctattg 
ctatctcctc 
ccggaggatc 
cagtctttcc 
tccaatggat 
ctttaaatat 
aggtaatata 
agactccaag 
acatcagatt 
gtaatattta 
aaataagacc 
gcccttcaga 
tcaacggatg 
tgatgttctg 
agaatttaac 
aaagtacatt 
aaatcaaaat 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 



<210> 82 
<211> 60 
<212> DNA 



<213> Homo sapiens 



<300> 

<308> NM_001809 



<400> 82 

ggggatgaat agaaaacctg taagctttga tgttctggtt acttctagta aattcctgtc 60 



<210> 83 

<211> 2205 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NMJD01909 



<400> 83 

gcgcacgccg gccgcgccca cgtgaccggt 
ggcccaactg cggcgtcatc ccggctataa 
ggccgccgcc gccatgcagc cctccagcct 
acccgcctcc gcgctcgtca ggatcccgct 
gtcggaggtt gggggctctg tggaggacct 
ccaggcggtg ccagccgtga ccgaggggcc 
cgcccagtac tacggggaga ttggcatcgg 
cgacacgggc tcctccaacc tgtgggtccc 
ttgctggatc caccacaagt acaacagcga 
ctcgtttgac atccactatg gctcgggcag 
gtcggtgccc tgccagtcag cgtcgtcagc 
gcaggtcttt ggggaggcca ccaagcagcc 
tggcatcctg ggcatggcct acccccgcat 
caacctgatg cagcagaagc tggtggacca 
cccagatgcg cagcctgggg gtgagctgat 



ccgggtgcaa acacgcgggt cagctgatcc 60 

gcgcacggcc tcggcgaccc tctccgaccc 12 0 

tctgccgctc gccctctgcc tgctggctgc 180 

gcacaagttc acgtccatcc gccggaccat 2 40 

gattgccaaa ggccccgtct caaagtactc 3 00 

cattcccgag gtgctcaaga actacatgga 3 60 

gacgcccccc cagtgcttca cagtcgtctt 42 0 

ctccatccac tgcaaactgc tggacatcgc 480 

caagtccagc acctacgtga agaatggtac 540 

cctctccggg tacctgagcc aggacactgt 600 

ctctgccctg ggcggtgtca aagtggagag 660 

aggcatcacc ttcatcgcag ccaagttcga 72 0 

ctccgtcaac aacgtgctgc ccgtcttcga 780 

gaacatcttc tccttctacc tgagcaggga 840 

gct^ggtggc acagactcca agtattacaa 9 00 
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gggttctctg 
ggtggaggtg 
aggcacttcc 
ggccgtgccg 
cgcgatcaca 
caaggtgtcg 
gccacccagc 
gtttgaccgt 
gcgtccgcgc 
cggcccctcc 
cggcggccca 
atgctgcctg 
agatctgttt 
ggggctccct 

gcccctaccc 
gggcccctct 
gggcccgttg 
acaaaaccca 
atgtattggc 
gacctctgtt 
ttggcctccg 
aaaaaaaaaa 



tcctacctga 
gccagcgggc 
ctcatggtgg 
ctgattcagg 
ctgaagctgg 
caggccggga 
gggccactct 
gacaacaaca 
gccagcacag 
cacacacacc 
agcccgactt 
cctgtctgtc 
cgtgcattgg 
tcatctccag 

cacaccaggc 
gcccagcttg 
aggatgaggc 
ccttgttgga 
ctggaggtgg 
gtcctcccct 
gcctcccctc 
aaaaaaaaaa 



atgtcacccg 
tgaccctgtg 
gcccggtgga 
gcgagtacat 
gaggcaaagg 
agaccctctg 
ggatcctggg 
gggtgggctt 
aaacagagga 
cacacactcg 
gctgttttgt 
tctccatctg 
aagaccccac 
ggagtcccct 

cgtcctcccg 
ggcccagctg 
cgctagaggc 
gcctgcaggg 
ggttgggatt 
tgggcggctg 
aaaaaaaaaa 
aaaaaaaaaa 



caaggcctac 
caaggagggc 
t^aggtgcgc 
gatcccctgt 
ctacaagctg 
cctgagcggc 
cgacgtcttc 
cgccgaggct 
gagtcccaga 
cccgcccact 
tctgtggttt 
tttggtgggg 
ccaagcttgg 
ccccggccct 

ggccctccct 
ggctctgcca 
ctgaggatga 
t^gtgctggg 

g-^gggctggt 

agagccccag 
a aaaaaaaaa 
aaaaaaaaaa 



tggcaggtcc 
tgtgaggcca 
gagctgcaga 
gagaaggtgt 
tccccagagg 
ttcatgggca 
atcggccgct 
gcccgcctct 
gcaggaggcc 
gtcctgggcg 
tcccctccct 
gtagagctga 
cagccgagct 
accagcgccc 

tggaaacctg 
ccctacctgt 
gctggaagga 
actgagccag 
gccagccttc 
ctgacatgga 
aaaaaaaaaa 
aaaaa 22 0^ 



acctggacca 
ttgtggacac 
aggccatcgg 
ccaccctgcc 
actacacgct 
tggacatccc 
actacactgt 
agttcccaag 
cctggcccag 
ccctggaagc 
gggttcagaa 
tccagagcac 
cgtgtatcct 
gctgggctga 

ccctgcctga 
tcagtgtccc 
gtgagagggg 
tcccaggggc 
ctctgcagct 
aatacagttg 
aaaaaaaaaa 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 
1860 
1920 
1980 
2040 
2100 
2160 



<210> 84 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_001909 
<400> 84 

tctgtttggt gggggtagag ctgatccaga gcacagatct gtttcgtgca ttggaagacc 



60 



<210> 85 

<211> 817 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002038 



<400> 85 

gaaccgttta 

cttctctcct 

cggtatcgct 

agaaaaagtg 

tggccgtcgg 

gcatcgcggc 

gcggcgtgcc 

gcgtcgtcat 

gtgaggagga 

tcttccagtt 

gttctcacta 

actgcagcct 

tacaagcatg 

cctagatgtg 



ctcgctgctg 
ccaaggtcta 
tttcttgtgc 
ctcggagagc 
aggaggactc 
caactcggtg 
cgccgggggg 
aggtaatatt 
tgaggagtag 
aggatctaga 
tattgtccag 
ccaactccta 
cgccgacgat 
aaaacagaat 



tgcccatcta 
gtgacggagc 
tacctgctgc 
tcggacagcg 
gcagtcgccg 
gctgcctcgc 
ctagtggcca 
ggtgccctga 
ccagcagctc 
actttgcctt 
gctagagtgc 
gcctcaagtg 
gcccagaatc 
aaacttcacc 



tcagcaggct ccgggctgaa gattgcttct 6 0 

ocgcgcgcgg cgccaccatg cggcagaagg 120 

tcttcacttg cagtggggtg gaggcaggta 180 

Q-ctccgggtt ctggaaggcc ctgaccttca 240 

cjgctgcccgc gctgggcttc accggcgccg 3 00 

tgatgagctg gtctgcgatc ctgaatgggg 3 60 

cgctgcagag cctcggggct ggtggcagca 42 0 

tgggctacgc cacccacaag tatctcgata 480 

ccagaacctc ttcttccttc ttggcctaac 540 

tttttttttt tttttttttt tttgagatgg 600 

ctgtggctatt cacagatgcg aacatagtac 660 

atcctcctgt ctcaacctcc caagtaggat 72 0 

cagaactttg tctatcactc tccccaacaa 7 80 
cagaaaa 817 



<210> 86 
<211> 60 
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<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002038 
<400> 86 

agctcccaga acctcttctt ccttcttggc ctaactcttc cagttaggat ctagaacttt 60 

<210> 87 
<211> 1283 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_002046 
<400> 87 

ctctctgctc ctcctgttcg acagtcagcc gcatcttctt ttgcgtcgcc agccgagcca 60 

catcgctcag acaccatggg gaaggtgaag gtcggagtca acggatttgg tcgtattggg 12 0 

cgcctggtca ccagggctgc ttttaactct ggtaaagtgg atattgttgc catcaatgac 180 

cccttcattg acctcaacta catggtttSLC atgttccaat atgattccac ccatggcaaa 240 

ttccatggca ccgtcaaggc tgagaacgg-g- aagcttgtca tcaatggaaa tcccatcacc 3 00 

atcttccagg agcgagatcc ctccaaaatc aagtggggcg atgctggcgc tgagtacgtc 3 60 

gtggagtcca ctggcgtctt caccaccatg gagaaggctg gggctcattt gcagggggga 42 0 

gccaaaaggg tcatcatctc tgccccctct gctgatgccc ccatgttcgt catgggtgtg 480 

aaccatgaga agtatgacaa cagcctcaag atcatcagca atgcctcctg caccaccaac 540 

tgcttagcac ccctggccaa ggtcatcca_t gacaactttg gtatcgtgga aggactcatg 6 00 

accacagtcc atgccatcac tgccacccag aagactgtgg atggcccctc cgggaaactg 660 

tggcgtgatg gccgcggggc tctccagaac atcatccctg cctctactgg cgctgccaag 72 0 

gctgtgggca aggtcatccc tgagctgaac gggaagctca ctggcatggc cttccgtgtc 7 80 

cccactgcca acgtgtcagt ggtggacctg acctgccgtc tagaaaaacc tgccaaatat 840 

gatgacatca agaaggtggt gaagcaggcg tcggagggcc ccctcaaggg catcctgggc 900 

tacactgagc accaggtggt ctcctctgac ttcaacagcg acacccactc ctccaccttt 960 

gacgctgggg ctggcattgc cctcaacgac cactttgtca agctcatttc ctggtatgac 102 0 

aacgaatttg gctacagcaa cagggtggtg gacctcatgg cccacatggc ctccaaggag 1080 

taagacccct ggaccaccag ccccagcaag agcacaagag gaagagagag accctcactg 1140 

ctggggagtc cctgccacac tcagtccccc accacactga atctcccctc ctcacagttg 12 00 

ccatgtagac cccttgaaga ggggagggg-c ctagggagcc gcaccttgtc atgtaccatc 1260 
aataaagtac cctgtgctca acc 12 83 

<210> 88 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002046 
<400> 88 

ctcaacgacc actttgtcaa gc teat t toe tggtatgaca acgaatttgg ctacagcaac 60 

<210> 89 

<211> 1610 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NMJD02061 
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<400> 89 

ggcacgaggc tgcggccgca gtagccgga_g- ccggagccgc agccaccggt gccttccttt 6 0 

cccgccgccg cccagccgcc gtccggcctc cctcgggccc gagcgcagac caggctccag 12 0 

ccgcgcggcg ccggcagcct cgcgctccct ctcgggtctc tctcgggcct cgggcaccgc 180 

gtcctgtggg cggccgcctg cctgcccgcc cgcccgcagc cccttgcctg ccggcccctg 2 40 

ggcggcccgt gccatgggca ccgacagccg cgcggccaag gcgctcctgg cgcgggcccg 3 00 

caccctgcac ctgcagacgg ggaacctgct gaactggggc cgcctgcgga agaagtgccc 3 60 

gtccacgcac agcgaggagc ttcatgattg tatccaaaaa accttgaatg aatggagttc 42 0 

ccaaatcaac ccagatttgg tcagggagtt tccagatgtc ttggaatgca ctgtatctca 480 

tgcagtagaa aagataaatc ctgatgaaag agaagaaatg aaagtttctg caaaactgtt 540 

cattgtagaa tcaaactctt catcatcaac tagaagtgca gttgacatgg cctgttcagt 600 

ccttggagtt gcacagctgg attctgtgat cattgcttca cctcctattg aagatggagt 660 

taatctttcc ttggagcatt tacagcctta ctgggaggaa ttagaaaact tagttcagag 72 0 

caaaaagatt gttgccatag gtacctctg-a tctagacaaa acacagttgg aacagctgta 7 80 

tcagtgggca caggtaaaac caaatagtsta ccaagttaat cttgcctcct gctgtgtgat 840 

gccaccagat ttgactgcat ttgctaaaca atttgacata cagctgttga ctcacaatga 900 

tccaaaagaa ctgctttctg aagcaagttt ccaagaagct cttcaggaaa gcattcctga 960 

cattcaagcg cacgagtggg tgccgctgtg gctactgcgg tattcggtca ttgtgaaaag 1020 

tagaggaatt atcaaatcaa aaggctacat tttacaagct aaaagaaggg gttcttaact 1080 

gacttaggag cataacttac ctgtaatttc cttcaatatg agagaaaatt gagatgtgta 1140 

aaatctagtt actgcctgta aatggtgtca ttgaggcaga tattctttcg tcatatttga 12 00 

cagtatgttg tctgtcaagt tttaaatact tatcttgcct ccatatcaat ccattctcat 1260 

gaacctctgt attgctttcc ttaaactatt gttttctaat tgaaattgtc tataaagaaa 132 0 

atacttgcaa tatatttttc ctttattttt atgactaata taaatcaaga aaatttgttg 1380 

ttagatatat tttggcctag gtatcaggg-t aatgtatata catatttttt atttccaaaa 1440 

aaaattcatt aattgcttct taactcttat tataaccaag caatttaatt acaattgtta 1500 

aaactgaaat actggaagaa gatatttttc ctgtcattga tgagatatat cagagtaact 1560 
ggagtagctg ggatttacta gtagtgtaaa taaaattcac tcttcaatac 1610 

<210> 90 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_J)02061 
<400> 90 

ctgacttagg agcataactt acctgtaatt tccttcaata tgagagaaaa ttgagatgtg 60 

<210> 91 
<211> 873 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_002106 
<400> 91 

cgcagtttga atcgcggtgc gacgaaggag taggtggtgg gatctcaccg tgggtccgat 60 

tagccttttc tctgccttgc ttgcttgagc ttcagcggaa ttcgaaatgg ctggcggtaa 120 

ggctggaaag gactccggaa aggccaagac aaaggcggtt tcccgctcgc agagagccgg 180 

cttgcagttc ccagtgggcc gtattcatcg acacctaaaa tctaggacga ccagtcatgg 240 

acgtgtgggc gcgactgccg ctgtgtacaig cgcagccatc ctggagtacc tcaccgcaga 3 00 

ggtacttgaa ctggcaggaa atgcatcaaa agacttaaag gtaaagcgta ttacccctcg 3 60 

tcacttgcaa cttgctattc gtggagat^a agaattggat tctctcatca aggctacaat 42 0 

tgctggtggt ggtgtcattc cacacatcca caaatctctg attgggaaga aaggacaaca 480 

gaagactgtc taaaggatgc ctggattcct tgttatctca ggactctaaa tactctaaca 540 

gctgtccagt gttggtgatt ccagtgga.ct gtatctctgt gaaaaacaca attttgcctt 600 

tttgtaattc tatttgagca agttggaa_^t ttaattagct ttccaaccaa ccaaatttct 660 
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gcattcgagt cttaaccata tttaagtgtt 
aagtagtggg ttttgattga gttgactg*tt 
tgcagaagtt atagtaacaa acatttgg-tt 
taagttcaat aaaggtcata tcccaaacta 



actgtggctt caaagaagct attgattctg 72 0 

tttaaaaaac tgtttggatt ttaattgtga 780 

ttgtacagac attatttcca ctctggtgga 840 
aaa 873 



<210> 92 
<211> 60 
<212> UNA 

<213> Homo sapiens 



<300> 

<308> NM„002106 



<400> 92 

cgagtcttaa ccatatttaa gtgttactgt ggcttcaaag aagctattga ttctgaagta 60 

<210> 93 
<211> 4204 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_002205 



<400> 93 

caggacaggg 

tgcagctgcg 

tgccgccgcc 

cggggccccc 

gggtcagtgt 

gtggtgctgt 

ttgacagcaa 

ctgtggagta 

tcttggcatg 

tgggcacctg 

gctcagattt 

tcaccaagac 

tcctgtctgc 

tggttcaggg 

gatactctgt 

tgcccaaagg 

ccctctacaa 

cagacgtcaa 

ggacccctga 

ccggcataga 

gcagctcctt 

gggctccctt 

gagggctggg 

cagacttctt 

atctgattgt 

tgtccgctag 

gcagcttaga 

gaaaacacgt 

agaagggagg 

ccctgctcat 

acgagtcaga 

acccccaagc 

gccggataga 

ctgacctgca 

ccctgaacct 

ttcgggtcac 



aagagcgggc 
ctggggcccc 
acccagggtc 
gggctccttc 
gctggtggga 
ctacctctgt 
aggctctcgg 
caagtccttg 
cgctccactg 
ctacctctcc 
cagctgggca 
tggccgtgtg 
cactcaggag 
gcagctgcag 
ggctgttggt 
gaacctcact 
cttctcaggg 
tggggacggg 
cgggcggcct 
gcccacgccc 
gacccccctg 
tggtggggag 
ctctaagcct 
tggctctgcc 
ggggtccttt 
tgcctccctc 
ggggaaccct 
tgctgactcc 
ggtacggcgg 
ccagaatggg 
atttcgagac 
cccagtggac 
ggacaaggct 
gctggaagtg 
cactttccat 
cgcccctcca 



gctatggg-ga 
cggcgccgac 
gggggcttca 
ttcggattct 
gcacccaagg 
ccttgggg-tg 
ctcctggsLg-t 
cagtggtt eg 
tacagctggc 
acagataact 
gcaggaca-^g 
gttttagg-tg 
cagattgeag 
actcgccagg 
gaattcag-tg 
tacggcta_tg 
gaacagatgg 
ctggatga_ct 
caggaggtig-g 
acccttaccc 
ggggacctgg 
acccagcagg 
tcccaggttc 
cttcgagg-a.g 
ggtgtgga.ca 
accatcttcc 
gtggcctg-ca 
attggttt ca 
gcactgttcc 
gctcgaga_gg 
aaactctcgc 
agccacggcc 
cagatcttgc 
tttgggga.gc 
geccagaatg 
gaggctga_gt 



gccggacgcc 
ccccgctcgt 
acttagaege 
cagtggagtt 
ctaataccag 
ccagccccac 
cctcactgtc 
gggcaacagt 
gcacagagaa 
tcacccgaat 
gttactgeca 
gaccaggaag 
aatcttatta 
ccagttccat 
gtgatgacac 
tcaccatcct 
cctcctactt 
tgctggtggg 
gcagggtcta 
tcactggcca 
accaggatgg 
gagtagtgtt 
tgcagcccct 
gccgagacct 
aggctgtggt 
ccgccatgtt 
tcaaccttag 
cagtggaact 
tggcctccag 
attgeagaga 
cgattcacat 
tcaggccagc 
tggactgtgg 
agaaccatgt 
tgggtgaggg 
actcaggact 



agagtcccct 
gccgctgctg 
ggaggcccca 
ttaccggccg 
ccagccagga 
acagtgcacc 
cagctcagag 
tcgagcccat 
ggagccactg 
tctggagtat 
aggaggcttc 
ctatttctgg 
ccccgagtac 
ctatgatgac 
agaagacttt 
taatggctca 
tggctatgea 
ggcacccctg 
cgtctacctg 
tgatgagttt 
ctacaatgat 
tgtatttcct 
gtgggcagcc 
ggatggcaat 
atacaggggc 
caacccagag 
cttctgcctc 
tcagctggac 
gcaggcaacc 
gatgaagatc 
cgctctcaac 
cctacattat 
agaagacaac 
gtacctgggt 
tggegectat 
cgtcagacac 



ctccacgccg 
ttgetgeteg 
gcagtactct 
ggaacagacg 
gtgetgeagg 
cccattgaat 
ggagaggagc 
ggctcctcca 
agcgaccccg 
gcaccctgcc 
agtgccgagt 
caaggecaga 
ctgatcaacc 
agctacctag 
gttgctggtg 
gacattcgat 
gtggccgcca 
ctcatggatc 
cagcacccag 
ggccgatttg 
gtggccatcg 
gggggcccag 
agccacaccc 
ggatatcctg 
cgccccatcg 
gageggaget 
aatgcttctg 
tggcagaagc 
ctgacccaga 
tacctcagga 
ttctccttgg 
cagagcaaga 
atctgtgtgc 
gacaagaatg 
gaggctgagc 
ccagggaact 



60 

12 0 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 
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tctccagcct gagctgtgac tactttgccg tgaaccagag ccgcctgctg gtgtgtgacc 222 0 

tgggcaaccc catgaaggca ggagccagtc tgtggggtgg ccttcggttt acagtccctc 2280 

atctccggga cactaagaaa accatccagt ttgacttcca gatcctcagc aagaatctca 2340 

acaactcgca aagcgacgtg gtttcctttc ggctctccgt ggaggctcag gcccaggtca 2400 

ccctgaacgg tgtctccaag cctgaggca_g- tgctattccc agtaagcgac tggcatcccc 246 0 

gagaccagcc tcagaaggag gaggacctg-g- gacctgctgt ccaccatgtc tatgagctca 252 0 

tcaaccaagg ccccagctcc attagccag-g gtgtgctgga actcagctgt ccccaggctc 2580 

tggaaggtca gcagctccta tatgtgacca gagttacggg actcaactgc accaccaatc 2 640 

accccattaa cccaaagggc ctggagttg-g- atcccgaggg ttccctgcac caccagcaaa 27 0 0 

aacgggaagc tccaagccgc agctctgctt cctcgggacc tcagatcctg aaatgcccgg 27 6 0 

aggctgagtg tttcaggctg cgctgtgag-c tcgggcccct gcaccaacaa gagagccaaa 282 0 

gtctgcagtt gcatttccga gtctgggcca agactttctt gcagcgggag caccagccat 2 880 

ttagcctgca gtgtgaggct gtgtacaaa_cj ccctgaagat gccctaccga atcctgcctc 294 0 

ggcagctgcc ccaaaaagag cgtcaggto/g- ccacagctgt gcaatggacc aaggcagaag 3 00 0 

gcagctatgg cgtcccactg tggatcatca tcctagccat cctgtttggc ctcctgctcc 3 06 0 

taggtctact catctacatc ctctacaagc ttggattctt caaacgctcc ctcccatatg 312 0 

gcaccgccat ggaaaaagct cagctcaagc ctccagccac ctctgatgcc tgagtcctcc 3180 

caatttcaga ctcccattcc tgaagaacca gtccccccac cctcattcta ctgaaaagga 324 0 

ggggtctggg tacttcttga aggtgctga_c ggccagggag aagctcctct ccccagccca 330 0 

gagacatact tgaagggcca gagccagggg ggtgaggagc tggggatccc tcccccccat 33 6 0 

gcactgtgaa ggacccttgt ttacacatac cctcttcatg gatgggggaa ctcagatcca 342 0 

gggacagagg cccagcctcc ctgaagcctt tgcattttgg agagtttcct gaaacaactg 3 48 0 

gaaagataac taggaaatcc attcacagtt ctttgggcca gacatgccac aaggacttcc 3 54 0 

tgtccagctc caacctgcaa agatctgtcc tcagccttgc cagagatcca aaagaagccc 3 60 0 

ccagtaagaa cctggaactt ggggagttcLa. gacctggcag ctctggacag ccccaccctg 3 66 0 

gtgggccaac aaagaacact aactatgcat ggtgccccag gaccagctca ggacagatgc 372 0 

cacaaggata gatgctggcc cagggccaga gcccagctcc aaggggaatc agaactcaaa 378 0 

tggggccaga tccagcctgg ggtctggacjt tgatctggaa cccagactca gacattggca 3 84 0 

ccaatccagg cagatccagg actatatttg ggcctgctcc agacctgatc ctggaggccc 390 0 

agttcaccct gatttaggag aagccaggaa tttcccagga cctgaagggg ccatgatggc 396 0 

aacagatctg gaacctcagc ctggccagac acaggccctc cctgttcccc agagaaaggg 402 0 

gagcccactg tcctgggcct gcagaatttg ggttctgcct gccagctgca ctgatgctgc 408 0 

ccctcatctc tctgcccaac ccttccctca ccttggcacc agacacccag gacttattta 414 0 

aactctgttg caagtgcaat aaatctgacc cagtgccccc actgaccaga actagaaaaa 42 0 0 
aaaa 42 04 

<210> 94 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002205 
<400> 94 

ttggcaccag acacccagga cttatttaaa ctctgttgca agtgcaataa atctgaccca 60 

<210> 95 

<211> 1976 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_J)02266 
<400> 95 

gccacacggt ctttgagctg agtcgaggtg gaccctttga acgcagtcgc cctacagccg 60 

ctgattcccc ccgcatcgcc tcccgtggata gcccaggccc gcttcgcagc tttctccctt 120 

tgtctcataa ccatgtccac caacgagaat gctaatacac cagctgcccg tcttcacaga 180 

ttcaagaaca agggaaaaga cagtacagaa atgaggcgtc gcagaataga ggtcaatgtg 240 
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gagctgagga aagctaagaa ggatgaccag atgctgaaga ggagaaatgt aagctcattt 30 0 

cctgatgatg ctacttctcc gctgcag^aa aaccgcaaca accagggcac tgtaaattgg 36 0 

tctgttgatg acattgtcaa aggcataaat agcagcaatg tggaaaatca gctccaagct 42 0 

actcaagctg ccaggaaact actttccaga gaaaaacagc cccccataga caacataatc 48 0 

cgggctggtt tgattccgaa atttgtgtcc ttcttgggca gaactgattg tagtcccatt 54 0 

cagtttgaat ctgcttgggc actcactaac attgcttctg ggacatcaga acaaaccaag 60 0 

gctgtggtag atggaggtgc catcccagca ttcatttctc tgttggcatc tccccatgct 66 0 

cacatcagtg aacaagctgt ctgggctcta ggaaacattg caggtgatgg ctcagtgttc 72 0 

cgagacttgg ttattaagta cggtgca^tt gacccactgt tggctctcct tgcagttcct 780 

gatatgtcat ctttagcatg tggctactta cgtaatctta cctggacact ttctaatctt 840 

tgccgcaaca agaatcctgc acccccgata gatgctgttg agcagattct tcctacctta 90 0 

gttcggctcc tgcatcatga tgatccagaa gtgttagcag atacctgctg ggctatttcc 96 0 

taccttactg atggtccaaa tgaacgaeitt ggcatggtgg tgaaaacagg agttgtgccc 102 O 

caacttgtga agcttctagg agcttctgaa ttgccaattg tgactcctgc cctaagagcc 108 O 

atagggaata ttgtcactgg tacagatgaa cagactcagg ttgtgattga tgcaggagca 114 O 

ctcgccgtct ttcccagcct gctcaccaac cccaaaacta acattcagaa ggaagctacg 12 00 

tggacaatgt caaacatcac agccggccgc caggaccaga tacagcaagt tgtgaatcat 12 6 O 

ggattagtcc cattccttgt cagtgttctc tctaaggcag attttaagac acaaaaggaa 132 O 

gctgtgtggg ccgtgaccaa ctataccagt ggtggaacag ttgaacagat tgtgtacctt 13 8 O 

gttcactgtg gcataataga accgttg-a.tg aacctcttaa ctgcaaaaga taccaagatt 144 O 

attctggtta tcctggatgc catttcaaat atctttcagg ctgctgagaa actaggtgaa 150 O 

actgagaaac ttagtataat gattgaagaa tgtggaggct tagacaaaat tgaagctcta 15 6 O 

caaaaccatg aaaatgagtc tgtgtataag gcttcgttaa gcttaattga gaagtatttc 162 O 

tctgtagagg aagaggaaga tcaaaacgtt gtaccagaaa ctacctctga aggctacact 168 O 

ttccaagttc aggatggggc tcctgggacc tttaactttt agatcatgta gctgagacat 174 O 

aaatttgttg tgtactacgt ttggtatttt gtcttattgt ttctctacta agaactcttt 180 O 

cttaaatgtg gtttgttact gtagcacttt ttacactgaa actatacttg aacagttcca I86 0 

actgtacata catactgtat gaagcttgtc ctctgactag gtttctaatt tctatgtgga 192 O 
atttcctatc ttgcagcatc ctgtaaataa acattcaagt ccacccttaa aaaaaa 1976 

<210> 96 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002266 
<400> 96 

tgagtctgtg tataaggctt cgttaagrctt aattgagaag tatttctctg tagaggaaga 6 0 

<210> 97 

<211> 1145 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002346 
<400> 97 

gctccggcca gccgcggtcc agagcgcgcg aggttcgggg agctccgcca ggctgctggt 6 0 

acctgcgtcc gcccggcgag caggacaggc tgctttggtt tgtgacctcc aggcaggacg 12 O 

gccatcctct ccagaatgaa gatcttcttg ccagtgctgc tggctgccct tctgggtgtg 18D 

gagcgagcca gctcgctgat gtgcttctcc tgcttgaacc agaagagcaa tctgtactgc 2 4D 

ctgaagccga ccatctgctc cgaccaggac aactactgcg tgactgtgtc tgctagtgcc 3 0O 

ggcattggga atctcgtgac atttggccac agcctgagca agacctgttc cccggcctgc 3 60 

cccatcccag aaggcgtcaa tgttggtgtg gcttccatgg gcatcagctg ctgccagagc 42 O 

tttctgtgca atttcagtgc ggccgatggc gggctgcggg caagcgtcac cctgctgggt 4 80 

gccgggctgc tgctgagcct gctgcc^gcc ctgctgcggt ttggcccctg accgcccaga 5 40 

ccctgtcccc cgatccccca gctcagg-aag gaaagcccag ccctttctgg atcccacagt 6 0O 

gtatgggagc ccctgactcc tcacgtgcct gatctgtgcc cttggtccca ggtcaggccc 6 6(3 
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accccctgca cctccacctg ccccagcccc 
acttctgggg tggatgatgt gaccttcctt 
ggagtcttac ggtccaacat cagaccaagt 
agaccgtgtc agtagggatg tgtgcctggc 
gcacgtggcg gcttctgggg gccatgtttg 
cctcagtccc tgtagccccc tgccctggca 
ggggttgggg tttctgccac ttccgggtct 
ccagcccacc cccacattgg agccctcctg 
tcccc 1145 



tgcctctgcc caagtgggcc agctgccctc 72 0 

gggggactgc ggaagggacg agggttccct 780 

cccatggaca tgctgacagg gtccccaggg 840 

tgtgtacgtg ggtgtgcagt gcacgtgaga 900 

gggagggagg tgtgccagca gcctggagag 960 

cagctgcatg cacttcaagg gcagcctttg 102 0 

aggccctgcc caaatccagc cagtcctgcc 108 0 

ctgctttggt gcctcaaata aatacagatg 114 0 



<210> 98 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_002346 



<400> 98 

ggttccctgg agtcttacgg tccaacatca gaccaagtcc catggacatg ctgacagggt 60 

<210> 99 

<211> 1390 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_0023 5 8 



<400> 99 

gggaagtgct 

ttgtgtccct 

gcgccgaaat 

gcatatatcc 

ctactgatct 

tatacaagtg 

tcctggaaag 

ccagagaaaa 

cagctacggt 

atacagacaa 

ccaattctga 

tggtggccta 

taattttgaa 

ttggttaatt 

cctttatttt 

cattgttcaa 

gatagtaact 

gttttggtca 

aaaggaagtc 

aacaatgaaa 

ttgaatcagt 

atatttgtac 

ttataaaatc 

aaaaaaaaaa 



gttggagccg 
ggccatggcg 
cgtggccgag 
atctgaaacc 
tgagctcata 
ttcagttcag 
atggcagttt 
gtctcagaaa 
gacatttctg 
agatttggtt 
ggaagtccgc 
caaaattcct 
atgtggtttt 
tttacatgga 
tttggtacct 
aaggaaccag 
gtagatggaa 
agtagtttga 
taaatattca 
tattgctgta 
ttccaattat 
tgtttaatgt 
aagttttaag 
1390 



ctgtg-g-ttgc 
ctgcagctct 
ttcttctcat 
tttactcgag 
aaatacctaa 
aaacfcggttg 
gatattgagt 
gctatccagg 
ccactgttgg 
gtacctgaaa 
cttccjttcat 
gtcaatgact 
cctgaaatca 
gaaaaccaaa 
atttgactta 
gaggfcttttt 
aaacttgtgc 
ctcagtatag 
gaatctttgt 
tagctccttt 
ttgactttaa 
tctgtgatac 
tgaaagtgag 



tgtccgcgga 
cccgggagca 
tcggcatcaa 
tgcagaaata 
ataatgtggt 
tagttatctc 
gtgacaagac 
atgaaatccg 
aagtttcttg 
aatgggaaga 
ttactactac 
gaggatgaca 
ggtcatctat 
atgatactta 
ccatggagtt 
tgtcaacatt 
tataaagcta 
gtagggagat 
taaggtcctg 
tgaccttcat 
tttatgtaac 
agaactctta 
gaaataaagt 



gtggaagcgc 
gggaatcacc 
cagcatttta 
cggactcacc 
ggaacaactg 
aaatattgaa 
tgcaaaagat 
ttcagtgatc 
ttcatttgat 
gtcgggacca 
aatccacaaa 
tgaggaaaat 
agttgatatg 
ctgaactgtg 
aacatcatga 
gtgatgtata 
gatgctttcc 
atttaagtat 
aaagtaactc 
ttcatgtata 
ttgaacctat 
aaaatgtttt 
taagtttgtt 



gtgcttttgt 
ctgcgcggga 
tatcagcgtg 
ttgcttgtaa 
aaagattggt 
agtggtgagg 
gacagtgcac 
agacagatca 
ctgctgattt 
cagtttatta 
gtaaatagca 
aatgtaattg 
ttttatttca 
tgtaattgtt 
atttattgca 
ttcctttgaa 
taaatcagat 
aaaatacaac 
ataatctata 
gttttcccta 
gaagcaatgg 
ttcatgtgtt 
ttaaaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

12O0 

1260 

1320 

1380 



<210> 100 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 
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<308> NM_0Q23 5 8 
<400> 100 

atgctttcct aaatcagatg ttttggtcaa gtagtttgac tcagtatagg tagggagata 60 

<210> 101 

<211> 1821 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002422 
<400> 101 

acaaggaggc aggcaagaca gcaaggcata gagacaacat agagctaagt aaagccagtg 6 0 

gaaatgaaga gtcttccaat cctact^ttg ctgtgcgtgg cagtttgctc agcctatcca 12 0 

ttggatggag ctgcaagggg tgaggacacc agcatgaacc ttgttcagaa atatctagaa 180 

aactactacg acctcaaaaa agatgtgaaa cagtttgtta ggagaaagga cagtggtcct 2 40 

gttgttaaaa aaatccgaga aatgcagaag ttccttggat tggaggtgac ggggaagctg 3 00 

gactccgaca ctctggaggt gatgcgcaag cccaggtgtg gagttcctga tgttggtcac 3 60 

ttcagaacct ttcctggcat cccgaagtgg aggaaaaccc accttacata caggattgtg 42 0 

aattatacac cagatttgcc aaaagatgct gttgattctg ctgttgagaa agctctgaaa 480 

gtctgggaag aggtgactcc actcacattc tccaggctgt atgaaggaga ggctgatata 540 

atgatctctt ttgcagttag agaacatgga gacttttacc cttttgatgg acctggaaat 600 

gttttggccc atgcctatgc ccctgggcca gggattaatg gagatgccca ctttgatgat 6 60 

gatgaacaat ggacaaagga tacaacaggg accaatttat ttctcgttgc tgctcatgaa 720 

attggccact ccctgggtct ctttcactca gccaacactg aagctttgat gtacccactc 7 80 

tatcactcac tcacagacct gactcg-gttc cgcctgtctc aagatgatat aaatggcatt 840 

cagtccctct atggacctcc ccctgactcc cctgagaccc ccctggtacc cacggaacct 900 

gtccctccag aacctgggac gccagccaac tgtgatcctg ctttgtcctt tgatgctgtc 9 60 

agcactctga ggggagaaat cctgatcttt aaagacaggc acttttggcg caaatccctc 1020 

aggaagcttg aacctgaatt gcatttgatc tcttcatttt ggccatctct tccttcaggc 1080 

gtggatgccg catatgaagt tactagcaag gacctcgttt tcatttttaa aggaaatcaa 1140 

ttctgggcca tcagaggaaa tgaggtacga gctggatacc caagaggcat ccacacccta 12 0 0 

ggtttccctc caaccgtgag gaaaatcgat gcagccattt ctgataagga aaagaacaaa 12 6 0 

acatatttct ttgtagagga caaatactgg agatttgatg agaagagaaa ttccatggag 132 0 

ccaggctttc ccaagcaaat agctgaagac tttccaggga ttgactcaaa gattgatgct 13 80 

gtttttgaag aatttgggtt cttttatttc tttactggat cttcacagtt ggagtttgac 1440 

ccaaatgcaa agaaagtgac acacactttg aagagtaaca gctggcttaa ttgttgaaag 150 0 

agatatgtag aaggcacaat atgggcactt taaatgaagc taataattct tcacctaagt 156 0 

ctctgtgaat tgaaatgttc gttttctcct gcctgtgctg tgactcgagt cacactcaag 1620 

ggaacttgag cgtgaatctg tatcttgccg gtcattttta tgttattaca gggcattcaa 16 80 

atgggctgct gcttagcttg caccttgtca catagagtga tctttcccaa gagaagggga 1740 

agcactcgtg tgcaacagac aagtgactgt atctgtgtag actatttgct tatttaataa 1800 
agacgatttg tcagttgttt t 182 1 

<210> 102 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002422 
<400> 102 

tgtagaaggc acaatatggg cactttaaat gaagctaata attcttcacc taagtctctg 60 

<210> 103 
<211> 2787 
<212> DNA 



48 



WO 2005/086891 



PCT/US2005/007894 



<213> Homo sapiens 
<300> 

<308> NM_002462 



<400> 103 

agagcggagg 

ggaatcccag 

cagctcaggg 

gctacaggaa 

attccaagga 

tatttgaagg 

gtggacatcg 

gctactgtgg 

gaggagaagg 

caggacctgg 

gtgttggagg 

ccgctggtgc 

taccaggact 

gcccagaatg 

atcagctccc 

gctgtgggca 

atccagaggc 

acagaggctc 

ttgacgaagc 

aacctcgtgt 

atccaggacc 

cacccatatt 

aaacttacca 

atcaaggaga 

gaagacgaaa 

atcactgctc 

agactccgac 

cataaaattt 

ccaggctttg 

gaagagccgg 

gatgtttcga 

attgaagaca 

cagatggaac 

agagagaagg 

tccagctcgg 

caggaggcca 

cagacgtacg 

tacagctggc 

cggcttgcac 

tctgtccagc 

gactgacgac 

ccgtggtgat 

ccaccctcag 

ggattttcag 

aactgacaca 

cccagtgttt 

tctctgtaat 



ccgcactcca 
tgtcacggtg 
gctttggaat 
gactcccact 
ggtgcaggag 
aacgtatatt 
caaaagctga 
cccagaaaaa 
tgcgcccctg 
ccctgccagc 
cactgtcagg 
tgaaactgaa 
acgagattga 
ccatcgccgg 
gagatgtccc 
atcagcctgc 
aggagacaat 
tcagcatggc 
ctgatctggt 
tccacctgaa 
agctgagcct 
tcagggatct 
gcgagctcat 
ctcaccagag 
atgaaaaaat 
tcatgcaagg 
acgagttcca 
tgagtagaaa 
tgaattacag 
ctgtggatat 
taaaaaattt 
ttagagcaga 
agattgtcta 
agctggaaga 
caacagactc 
gcaagcgcat 
gccagcagct 
tcctgaagga 
ggctgacgca 
cccgtagacg 
ttgagtgctc 
ttagcaggaa 
atgcacatga 
ccctcagaat 
tgctgaacat 
taggagcatg 
aaactcattt 



gcactgcgca gggaccgcct 

gacacgcctc cctcgcgccc 

tctgtgcjcca cactgcgagg 

ccctgaaatc tggagtgaag 

aacagctctg tgataccatt 

agagcttact ttgcaaagaa 

tccagctgct gcatcccacc 

tccaggctcg gtggctgaga 

catcgacctc attgactccc 

catcgccgtc at cggggacc 

agttgccctt cccagaggca 

gaaacttgtg aacgaagata 

gatttcg-gat gcttcagagg 

ggaaggaatg ggaatcagtc 

ggatctcjact ctaatagacc 

tgacattggg tataagatca 

cagcctcjgtg gtggtcccca 

ccaggaggtg gaccccgagg 

ggacaaagga actgaagaca 

gaagggttac atgattgtca 

gtccgaagcc ctgcagagag 

gctggacjgaa ggaaaggcca 

cacacatatc tgtaaatctc 

aataacagag gagctacaaa 

gttcttcctg atagataaaa 

agaggaaact gtaggggagg 

caaatggagt acaataattg 

aatccagaaa tttgaaaatc 

gacatttgag acaatcgtga 

gctacaoa.cc gtgacggata 

tgaagacjttt tttaacctcc 

acaagagaga gaaggtgaga 

ctgccacjgac caggtataca 

agaaaagaag aagaaatcct 

ttccatcjgag gagatctttc 

ctccagccac atccctttga 

tcagaaggcc atgctgcagc 

gcggagcgac accagcgaca 

ggctcgcjcgc cggcttgccc 

tgcacgcaca ctgtctgccc 

agtagtcaga ctggatagtc 

gctgtgatgag cagtttggtt 

gctggccjgga ttgaaggatg 

cgctccacct tgcagctctc 

cacagcttat ttcctcattt 

agtgcccjtgt gtgtgcgtcc 
ctagcacj 27 87 



tggaccgcag 
ttgccgccca 
agatcggttc 
aacgccgcca 
taacttgttg 
ggaagatggt 
ctctattact 
acaacctgtg 
tgcgggctct 
agagctcggg 
gcgggatcgt 
agtggagagg 
tagaaaagga 
atgagctaat 
ttcctggcat 
agacactcat 
gtaatgtgga 
gagacaggac 
aggttgtgga 
agtgccgggg 
agaagatctt 
cggttccctg 
tgcccctgtt 
agtatggtgt 
ttaatgcctt 
aagacattcg 
aaaacaattt 
agtatcgtgg 
aacagcaaat 
tggtccggct 
acagaaccgc 
agctgatccg 

ggggtgcatt 

gggattttgg 
agcacctgat 
tcatccagtt 
tcctgcagga 
agcggaagtt 
agttccccgg 
ccgttcccgg 
cgtctctgct 
tctagcatga 
ctgtcttcgt 
cccttctctg 
ttataatgtc 
tgtcggagcc 



ttgccggcca 
cctgctcacc 
tgggtcggag 
tccagccacc 
acattacttt 
tgtttccgaa 
gaatggagat 
cagccagtat 
aggtgtggag 
caagagctcc 
gaccagatgc 
caaggtcagt 
aattaataaa 
caccctggag 
aaccagagtg 
caagaagtac 
catcgccacc 
catcggaatc 
cgtggtgcgg 
ccagcaggag 
ctttgagaac 
cctggcagaa 
agaaaatcaa 
cgacataccg 
taatcaggac 
gctgtttacc 
tcaagaaggc 
tagagagctg 
caaggcactg 
tgctttcaca 
caagtccaaa 
cctccacttc 
gcagaaggtc 
ggctttccag 
ggcctatcac 
cttcatgctc 
caaggacacc 
cctgaaggag 
ttaaccacac 
gtagccactg 
tatccgttag 
agacagagcc 
actgggaaag 
tattcctaga 
ccttcacaaa 
ctgtctcctc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 



<210> 104 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM.002462 
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<400> 104 

actgacacat gctgaacatc acagcttatt tcctcatttt tataatgtcc cttcacaaac 6 0 

<210> 105 

<211> 2808 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NMJD02759 
<400> 105 

gcggcggcgg cggcgcagtt tgctcatact ttgtgacttg cggtcacagt ggcattcagc 6 0 

tccacacttg gtagaaccac aggcacgaca agcatagaaa catcctaaac aatcttcatc 12 O 

gaggcatcga ggtccatccc aataaaaatc aggagaccct ggctatcata gaccttagtc 180 

ttcgctggta tactcgctgt ctgtcaacca gcggttgact ttttttaagc cttctttttt 2 40 

ctcttttacc agtttctgga gcaaattcag tttgccttcc tggatttgta aattgtaatg 3 0O 

acctcaaaac tttagcagtt cttccatctg actcaggttt gcttctctgg cggtcttcag 3 60 

aatcaacatc cacacttccg tgattatctg cgtgcatttt ggacaaagct tccaaccagg 42 O 

atacgggaag aagaaatggc tggtgettctt tcagcaggtt tcttcatgga ggaacttaat 4 80 

acataccgtc agaagcaggg agtagtactt aaatatcaag aactgcctaa ttcaggacct 5 40 

ccacatgata ggaggtttac atttcaagtt ataatagatg gaagagaatt tccagaaggt 6 0O 

gaaggtagat caaagaagga agcaaaaaat gccgcagcca aattagctgt tgagatactt 6 60 

aataaggaaa agaaggcagt tagtccttta ttattgacaa caacgaattc ttcagaagga 720 

ttatccatgg ggaattacat aggccttatc aatagaattg cccagaagaa aagactaact 7 80 

gtaaattatg aacagtgtgc atcggg-ggtg catgggccag aaggatttca ttataaatgc 840 

aaaatgggac agaaagaata tagtattggt acaggttcta ctaaacagga agcaaaacaa 9 0O 

ttggccgcta aacttgcata tcttcaigata ttatcagaag aaacctcagt gaaatctgac 9 60 

tacctgtcct ctggttcttt tgctactacg tgtgagtccc aaagcaactc tttagtgacc 102 0 

agcacactcg cttctgaatc atcatctgaa ggtgacttct cagcagatac atcagagata 1080 

aattctaaca gtgacagttt aaacagttct tcgttgctta tgaatggtct cagaaataat 114=0 

caaaggaagg caaaaagatc tttggcaccc agatttgacc ttcctgacat gaaagaaaca 12 00 

aagtatactg tggacaagag gtttgg-catg gattttaaag aaatagaatt aattggctca 1260 

ggtggatttg gccaagtttt caaagcaaaa cacagaattg acggaaagac ttacgttatt 132 0 

aaacgtgtta aatataataa cgagaaggcg gagcgtgaag taaaagcatt ggcaaaactt 1330 

gatcatgtaa atattgttca ctacaatggc tgttgggatg gatttgatta tgatcctgag 144=0 

accagtgatg attctcttga gagcagtgat tatgatcctg agaacagcaa aaatagttca 15O0 

aggtcaaaga ctaagtgcct tttcatccaa atggaattct gtgataaagg gaccttggaa 156 0 

caatggattg aaaaaagaag aggcgagaaa ctagacaaag ttttggcttt ggaactcttt 162 0 

gaacaaataa caaaaggggt ggattatata cattcaaaaa aattaattca tagagatctt 168 0 

aagccaagta atatattctt agtagataca aaacaagtaa agattggaga ctttggactt 174 0 

gtaacatctc tgaaaaatga tggaaagcga acaaggagta agggaacttt gcgatacatg 18O0 

agcccagaac agatttcttc gcaagactat ggaaaggaag tggacctcta cgctttgggg 186 0 

ctaattcttg ctgaacttct tcatgtatgt gacactgctt ttgaaacatc aaagtttttc 192 0 

acagacctac gggatggcat catctcagat atatttgata aaaaagaaaa aactcttcta 1980 

cagaaattac tctcaaagaa acctgaggat cgacctaaca catctgaaat actaaggacc 2 04=0 

ttgactgtgt ggaagaaaag cccagagaaa aatgaacgac acacatgtta gagcccttct 21O0 

gaaaaagtat cctgcttctg atatgcagtt ttccttaaat tatctaaaat ctgctaggga 216 0 

atatcaatag atatttacct tttattttaa tgtttccttt aattttttac tatttttact 2 22 0 

aatctttctg cagaaacaga aaggttttct tctttttgct tcaaaaacat tcttacattt 2280 

tactttttcc tggctcatct ctttattctt tttttttttt ttaaagacag agtctcgctc 2340 

tgttgcccag gctggagtgc aatgacacag tcttggctca ctgcaacttc tgcctcttgg 2 4O0 

gttcaagtga ttctcctgcc tcagcctcct gagtagctgg attacaggca tgtgccaccc 2 46 0 

acccaactaa tttttgtgtt tttaataaag acagggtttc accatgttgg ccaggctggt 2 52 0 

ctcaaactcc tgacctcaag taatccacct gcctcggcct cccaaagtgc tgggattaca 2 580 

gggatgagcc accgcgccca gcctcatctc tttgttctaa agatggaaaa accaccccca 2 640 

aattttcttt ttatactatt aatgaatcaa tcaattcata tctatttatt aaatttctac 2 7O0 

cgcttttagg ccaaaaaaat gtaagatcgt tctctgcctc acatagctta caagccagct 2 76 0 
ggagaaatat ggtactcatt aaaaaaaaaa aaaaagtgat gtacaacc 2 808 
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<210> 106 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_002759 



<400> 106 

tcgttctctg cctcacatag cttacaagcc agctggagaa atatggtact cattaaaaaa 60 

<210> 107 

<211> 1678 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_002811 



<400> 107 

aagaaggagg 

accggtgacc 

gggtgtgtcg 

ctcagtgtgg 
ggtgtgcttt 
ccttttgatg 
aacatgtatg 
acaggcccta 
cctaattccg 
gcgtacattt 
cacgtgacca 
cgagatatca 
ggtttgaagg 
acaggcaagc 
ctgccagatg 
atggtggtag 
aacaacaaga 
aaaaaggata 
aaagaggaga 
ttaaaatctt 
tattagaaag 
gtggaagtga 
ggatatgata 
tattttggta 
aaagactctt 
gcattatgtt 
acaccatgtt 
ttgtttacct 



ccgcgcgagg 
gctactgctg 
cgatgccgga 
tggatcattt 
tggggtcatg 
aagatgacaa 
gaatgtttaa 
aactacacaa 
tattggtcat 
cagtggaaga 
gtgaaattgg 
aagacacgac 
gactgaactc 
tgcccatcaa 
tcagcctgca 
tgtacttggc 
ttgccaaccg 
ggaaagagga 
aaaaggagaa 
acaaactaaa 
ctgacccaac 
atggagactg 
gtcagctcag 
ctcttcattc 

ggggtctgtt 

ctctggtaac 
ctggacccat 
tcaaagatac 



gctgacgaac 
ccggtgtttg 
gctggcagtg 
caaccgaatc 
gcaaaagaaa 
agacgattct 
gaaagtcaat 
gaatgacatt 
cattgatgtg 
agtccatgat 
agcagaggaa 
gg"tgggcact 
caagcttctg 
ccaccagatc 
gcgagttcgtc 
ctcgctgatc 
ggatgcagag 
caaggagaaa 
aaagtaaaac 
toagtgtgct 
aagagctctc 
atctcaaatc 
gcttcagatt 
atttatctct 
totggtattt 
acgtagagtt 
agctctggca 
aattaaatgg 



cggaagaaga 
cgtgtggcag 
cagaaggtgg 
ggcaaggttg 
gtacttgatg 
gtatggtttt 
gccagggaaa 
gccatcaacg 
aagccgaagg 
gatggaactc 
gctgaggaag 
ctgtcccagc 
gatatcagga 
atctaccagc 
aaggcctttt 
cgttccgtgg 
aagaaagaag 
gataaagata 
atgtattaaa 
gctagagggt 
tgcctccggt 
tgaactgcag 
gtatgagaaa 
aaaaccagga 
tacaaaattg 
cagacccttc 
tcctcagggg 
cttgattttt 



ggaactgggc 
ggagccaggc 
tggtccaccc 
gaaaccagaa 
tatcgaacag 
tagaccatga 
gaatagttgg 
aactcatgaa 
acctagggct 
caacctcgaa 
ttggagttga 
ggatcacaaa 
gctacctgga 
tgcaggacgt 
acctgaagac 
tcgccctgca 
ggcaggagaa 
aggaaaagag 
tagctttttt 
tctttttcac 
cactcttgct 
ctttcgctgc 
aatgaagaga 
gttgaatttt 
ctaagtggaa 
tgaactctgt 
ttgtgatcca 
aaaaaaaaaa 



ctgaaaggg t 60 

ctggcgagcg 12 0 

cctggtgctcj 180 

gcgtgttgtt 240 

ttttgcagtt 3 00 

ttatttggaa. 3 60 

ctggtaccac 42 0 

aagatactg-t 480 

gcctacagaa 540 

aacatttgaa 6 00 

acacttgtta 6 60 

ccaggtccat 720 

aaaagtcgc c 7 80 

cttcaacctg 840 

caatgaccag 900 

caacctcat c 960 

agaagagag- c 102 0 

tgatgtaaa.^ 1080 

aatttgtaaa 1140 

ttgacatgct 12 00 

gtggtgcta-C 1260 

tgtgagttg-g 132 0 

agtcaacaaa 13 80 

cctcatcttg 1440 

tgcatgaatt 1500 

tgataatacc 1560 

gctccatata 162 0 
aaaaaaaa 1678 



<210> 108 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_002811 



<400> 108 

aaattgctaa gtggaatgca t^aattgcat tatgttctct ggtaacacgt agagttcacja 60 



<210> 109 
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<211> 846 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_002888 



<400> 109 

ccacgtccgg ggtgccg-agc caactttcct gcgtccatgc agccccgccg gcaacggctg 60 

cccgctccct ggtccgg-gcc caggggcccg cgccccaccg ccccgctgct cgcgctgctg 12 0 

ctgttgctcg ccccggfcggc ggcgcccgcg gggtccgggg gccccgacga ccctgggcag 180 

cctcaggatg ctggggtccc gcgcaggctc ctgcagcaga aggcgcgcgc ggcgcttcac 240 

ttcttcaact tccggtccgg ctcgcccagc gcgctgcgag tgctggccga ggtgcaggag 3 00 

ggccgcgcgt ggattaaLtcc aaaagaggga tgtaaagttc acgtggtctt cagcacagag 3 60 

cgctacaacc cagagtcttt acttcaggaa ggtgagggac gtttggggaa atgttctgct 42 0 

cgagtgtttt tcaagaatca gaaacccaga ccaaccatca atgtaacttg tacacggctc 480 

atcgagaaaa agaaaagaca acaagaggat tacctgcttt acaagcaaat gaagcaactg 540 

aaaaacccct tggaaatagt cagcatacct gataatcatg gacatattga tccctctctg 600 

agactcatct gggattfcggc tttccttgga agctcttacg tgatgtggga aatgacaaca 6 60 

caggtgtcac actactactt ggcacagctc actagtgtga ggcagtgggt aagaaaaacc 72 0 

tgaaaattaa cttgtgccac aagagttaca atcaaagtgg tctccttaga ctgaattcat 780 

gtgaacttct aatttcatat caagagttgt aatcacattt atttcaataa atatgtgagt 840 
tcctgc 846 

<210> 110 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM__002 888 
<400> 110 

aaagaaaaga caacaapjagg attacctgct ttacaagcaa atgaagcaac tgaaaaaccc 60 



<210> 111 
<211> 1054 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_003090 



<400> 111 

gaattccgcg ggaggccacg ggctttccac agcgcggggg aacgggaggc tgcaggatgg- 60 

tcaagctgac ggcggag-ctg atcgagcagg cggcgcagta caccaacgcg gtgcgcgacc 120 

gggagctgga cctccgg-ggg tataaaattc ccgtcattga aaatctaggt gctacgttagr 180 

accagtttga tgctatfcgat ttttctgaca atgagatcag gaaactggat ggttttcctt 240 

tgttgagaag actgaaaaca ttgttagtga acaacaacag aatatgccgt ataggtgagg- 3 00 

gacttgatca ggctctg-ccc tgtctgacag aactcattct caccaataat agtctcgtgg - 3 60 

aactgggtga tctggaccct ctggcatctc tcaaatcgct gacttaccta agtatcctaa 420 

gaaatccggt aaccaataag aagcattaca gattgtatgt gatttataaa gttccgcaagr 480 

tcagagtact ggatttcucag aaagtgaaac taaaagagcg tcaggaagca gagaaaatgt 540 

tcaagggcaa acggggtgca cagcttgcaa aggatattgc caggagaagc aaaactttta 600 

atccaggtgc tggtttQ-cca actgacaaaa agagaggtgg gccatctcca ggggatgtagr 660 

aagcaatcaa gaatgccata gcaaatgctt caactctggc tgaagtggag aggctgaagor 720 

ggttgctgca gtctggfccag atccctggca gagaacgcag atcagggccc actgatgatg- 7 80 

gtgaagaaga gatggasigaa gacacagtca caaacgggtc ctgagcagtg aggcagatgt 840 

ataataatag gccctcttgg aacaagtctt gcttttcgaa catggtataa tagccttgtt 900 
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tgtgttagca aagtggaatc tatcagcatt gttgaaatgc ttaagactgc tgctgataat 960 
tttgtaatat aagttttgaa atctaaatgt caattttcta caaattataa aaataaactc 102 0 
cactctctat gctaaaaaaa aaaaaaagga attc 1054 



<210> 112 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003090 



<400> 112 

taatagcctt gtttgtgtta gcaaagtgga atctatcagc attgttgaaa tgcttaagac 60 



<210> 113 

<211> 2033 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NMJ303158 



<400> 113 

gaattccggg 

tcactccatt 

gcgcggtgct 

taaagaaaac 

tgttctcgtg 

tcagcgggtc 

tgtctccagt 

acctcatcct 

gcacctgaaa 

agaggcagtg 

taatgtttat 

taaagctcag 

gtcccacctt 

agtctaccta 

ttcaaagttt 

ttactgtcat 

atcagctgga 

gaggaccact 

gcatgatgag 

gaagcctcct 

attcacattc 

gcataatccc 

aaattcatca 

atcgtgcagg 

actgaagttt 

tttgttttta 

acatgacact 

ctggaggcaa 

gcacaattct 

gaatagggag 

aggaatctgg 

catctccata 

ttgggataca 

tggagtcata 



actgagctct 
gcccaggcca 
catgcccgta 
tgcatttcag 
actcagcaat 
ttgtgtcctt 
cacaagccgg 
gtctccaggc 
ataatcctga 
gctttggaag 
ttggcaagag 
ctggagaaag 
cggcatccta 
attctggaat 
gatgagcaga 
tcgaagagag 
gagcttaaaa 
ctctgtggca 
aaggtggatc 
tttgaggcaa 
cctgactttg 
agccagaggc 
aaaccatcaa 
gggagaaatc 
attttaccat 
ctcagcaggt 
ctgaagtgaa 
ggttcgactg 
cgtggctact 
gttaaggcac 
tgtctgtccg 
tgagagtgtg 
gaagaggcca 
gcatgtgtgt 



tgaagacttg 
gagtgcggga 
atcccagcat 
gacctgttaa 
ttccttgtca 
caaattcttc 
ttcagaatca 
cactgaataa 
ggaggaactg 
actttgaaat 
aaaagcaaag 
ccggagtgga 
atattcttag 
atgcaccact 
gaactgctaa 
ttattcatag 
ttgcagattt 
ccctggacta 
tctggagcct 
acacatacca 
taacagaggg 
caatgctcag 
attgccaaaa 
cttgagccag 
tgactgctgc 
gtgccttaac 
agtagccacg 
cagtcgaacc 
ttcacttcag 
acctgctgag 
gccccggtag 
aaaataggaa 
tgtgtctcag 
aaactttaaa 



ggtccttggt 
tatttgataa 
tttcggaggc 
ggctacagct 
gaatccatta 
ccagcgcgtt 
gaagcagaag 
cacccaaaag 
gcatcaaaac 
tggtcgccct 
caagtttatt 
gcatcagctc 
actgtatggt 
tggaacagtt 
cttatataac 
agacattaag 
tgggtggtca 
cctgccccct 
tggagttctt 
agagacctac 
agccagggac 
agaagtactt 
caaagaatca 
ggctgccata 
cctcaatcta 
ctccctattc 
agaattgtgc 
ttgcctccag 
agtgtcatgt 
taaaacaaat 
gcctgttggg 
cacgtgctct 
agctgttaag 
tatgcaggcc 



cgcaggtgga 
gaaacttcag 
cgaggcatca 
ccagttggag 
cctgtaaata 
cctttgcaag 
caattgcagg 
agcaagcagc 
agaaaaatga 
ctgggtaaag 
ctggctctta 
agaagagaag 
tatttccatg 
tatagagaac 
agaattgcaa 
ccagagaact 
gtacatgctc 
gaaatgattg 
tgctatgaat 
aaaagaatat 
ctcatttcaa 
gaacacccct 
gctagcaaac 
taacctgaca 
gaacgctaca 
agaaagctcc 
tacttatact 
attatgaacc 
ttattgatgt 
atttcttgtg 
tttctagtcc 
acctccattt 
ggcttatttt 
ttcgtggctc 



gcgacgggtc 

tgaaggccgg 

tggaccgatc 

gtccaaaacg 

gtggccaggc 

cacaaaagct 

caaccagtgt 

ccctgccatc 

agaatcaaaa 

gaaagtttgg- 

aagtgttatt 

tagaaataca 

atgctaccag- 

ttcagaaact 

atgccctgtc 

tacttcttgg- 

catcttccag 

aaggtcggat 

tfcttagttgg- 

cacgggttga. 

gactgttgaa. 

ggatcacagc 

agtcttagga. 

ggaacatgct 

caagaaatat 

acatcaataa 

ggaacataat 

agtataagta 

ggagctttcfc 

tagcgttctfc 

tccttaccat 

agggatttgc 

tttaaaacat 

gag 203 3 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 



<210> 114 
<211> 60 
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<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003158 



<400> 114 

ttgggtttct agtcctcctt 



accatcatct ccatatgaga gtgtgaaaat aggaacacgt 60 



<210> 115 

<211> 1421 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003258 



<400> 115 

acttactgcg 

agctgcatta 

caggtgattc 

cgcttccaga 

agcagcagct 

cgagacgtgg 

ttccctgaca 

gctgcactgg 

ctggccgaga 

tataccaaga 

cactccgtgt 

aacaaagaga 

tttgccccac 

ccgctccctt 

caggaggaag 

tgtgtggctg 

cttccctctc 

ggatctggca 

cctgcttctt 

gccagcacct 

cccaccctcc 

cccttcctac 

ggcaccaacc 

agggtccacg 



ggacggcctt 

acctgcccac 

tcgggccgat 

ttgctcagta 

tctgcacaca 

cccaggaggc 

tcatggagtt 

atgggacctt 

gcgtggtgaa 

ggctcggcac 

gtcggctctg 

actgcccagt 

agcagattct 

cctgccactg 

tcgggaggcg 

ccccacctgc 

agctgctggg 

cactccctct 

cccctctgcg 

ttgagccttg 

cctgaggatg 

ctctggtgat 

ttgctgggac 

cctctgctgt 



ggagagtact 
tgtgctgccc 
gttctcagga 
caagtgcctg 
tgaccggaac 
cctgggcgtg 
ctgcgaggcc 
ccagaggaag 
gctgacggcg 
agagaaggag 
ctacttcaag 
gccaggaaag 
gcaatgcagc 
ccgcctactg 
tggagggtga 
cgcatgctcc 
acgatcgccc 
ccttggggtg 
gctttcactg 
gcccacactg 
gcctggattc 
ggtttccaca 
ttggatccca 
agcttatgaa 



cgggttcgtg 
ggctccccca 
aaaagcacag 
gtgatcaagt 
accatggagg 
gctgtcatag 
atggccaacg 
ccatttgggg 
gtgtgcatgg 
gtcgaggtga 
aaggcctcag 
ccaggggaag 
cctgccaact 
gacgctgccc 
ccacaccttg 
ctcctctcct 
aggctggagc 
agggacagag 
ctgagtttct 
aggcttaggc 
acgccctctt 
ggaacaacag 
ggggcttatc 
attaactaat 



aacttcccgg 
gcaagacccg 
agttgatgag 
atgccaaaga 
cgctgcccgc 
gcatcgacga 
ccgggaagac 
ccatcctgaa 
agtgcttccg 
ttgggggagc 
gccagcctgc 
ccgtggctgc 
gagggacctg 
tgcatgctgc 
gccttctggg 
acccactggt 
tggccccgct 
ccccacgctg 
gttctccctg 
ctctctgcct 
gtttcctttt 
catctttcac 
tcttcaagtg 
t 1421 



aggcgcaatg 6 0 

ggggcagatc 12 0 

acgcgtccgt 180 

cactcgctac 240 

ctgcctgctc 300 

ggggcagttt 3 60 

cgtaattgtg 42 0 

cctggtgccg 480 

ggaagccgoc 540 

agacaagtac 600 

cgggccggac 66 0 

caggaagctc 72 0 

caagggccgc 7 80 

ccagccactc 840 

aactctcctt 900 

ctgcttaaag 9 60 

tggtggccfcg 102 0 

ttgacatcag 108 0 

ggaagcctg-t 1140 

gggatgggct 12 00 

gggctcaaag 12 60 

caagat ggcg-t 132 0 

t ggagaggg-c 13 80 



<210> 116 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_003258 
<400> 116 

cttcctacct ctggtgatgg/ tttccacagg aacaacagca tctttcacca agatgggt^g 



60 



<210> 117 
<211> 913 
<212> DNA 
<213> Homo sapiens 



<300> 

<308> NM_003311 
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<400> 117 

agagccggcg 

aatcccccga 

agctatggaa 

gcccccgcgc 

agcgcacggg 

tccgctgcgc 

agaaccgccg 

ccgccgagga 

ggccatcccc 

gagtcggacc 
cgtgcgcgcg 
ctatgtattt 
tttgcatcac 
gctgtggctc 
accctcatcc 
aaatgaatcg 



ccgtcaccgc 
cgaggtgctei 
gaagaagcgc 
gcgccccaag 
caagtacgtg 
gggcgagago 
cgccctgcag 
cgccgtggct 
gcagcccaaa 

gaggctaggsi 
gatcctcgc t 
atttcgctgg 
tgtgcccatt 
agtgagactc 
agtcctgcgg 
tgt 913 



ccgcattgcc 
cgcgagggcg 
ggggtgctca 
gagctgcgct 
tacttcacca 
tgctggaacg 
gactttcgca 
gccgcggccg 
ccccgcacgc 

cgtggccggc 
ccgttgcacg 
ttctttgtag 
gcaaataaat 
ctgctgggag 
ggctggtgtg 



gctcccagtc 
agttggagaa 
cctccgaccg 
tccactccat 
tcgtcaccac 
cggccatcgc 
gccgccagga 
ccgcaccctc 
catgagcccg 

gctctccagc 
ggcgccttaa 
tcacatattt 
cacttggcca 
ggtggaggcc 
aaaggcgctg 



ccgcgctcgg 
gcgcagcgac 
cctgagcctg 
cctcaaggtg 
cgaccacaag 
gctggcgctc 
acgcaccgca 
cgagccctcg 
ccgcgggcca 

cctgcagcag 
gttattggac 
tatagtctta 
gtttgctttt 
caggaatggg 
ggaaccggct 



cacgacatga 6 0 

agcctcttcc 12 0 

ttccccgcca 180 

gactgcg-tgg 240 

gagatcgact 3 00 

atcgatttcc 3 60 

cccgccgcac 42 0 

gagccctcca 480 

tacgctggac 540 

aagaacttcc 600 

tatctaatat 660 

atatcttgtt 72 0 

ctaccatccg 780 

cgggcaggac 840 

ttgaatcjaat 900 



<210> 118 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NMJD03311 



<400> 118 

atttcgctgg ttctttgtag- tcacatattt tatagtctta atatcttgtt tttgcatcac 



60 



<210> 119 

<211> 1723 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003376 



<400> 119 

tcgcggaggc 

cgctctgtcg 

ctcggtgctg 

catttttttt 

cttgaatcgg 

ggaaaccagc 

gagacggggt 

agtgacctgc 

cgcagctgac 

cacctcctcc 

gagcccgcgc 

ttcgtccaac 

gagccgagcg 

ggggaggagg 

aagccgggct 

gctccccagg 

gaggagagcg 

ggtcgggcct 

ctgctctacc 

cagaatcatc 

atcgagaccc 



ttggggcacjc 
ggaggcgca-g 
gaatttgata. 
taaaactgta 
gccgacggc t 
agaaagagga 
cagagagagc 
ttttggggg-t 
cagtcgcgc t 
ccggccggcg 
ccggaggcg-g 
ttctgggctg 
gagccgcgag 
aagaagagaa 
catggacgg-cj 
ccctggcccg 
ggccgcccca 
ccgaaaccat 
tccaccatcfc 
acgaagtgg- 1 
tggtggaca-t 



cgggtagctc 
cggttaggtg 
ttcattgatc 
ttgtttctcg 
tggggagatt 
aagaggtagc 
gcgcgggcgt 
gaccgccgga 
gacggacaga 
gcggacagtg 
ggtggagggg 
ttctcgcttc 
aagtgctagc 
ggaagaggag 
tgaggcggcg 
ggcctcgggc 
cagcccgagc 
gaactttctg 
caagtggtcc 
gaagttcatg 
cttccaggag 



ggaggtcgtg 
gaccggtcag 
cgggttttat 
ttttaattta 
gctctacttc 
aagagctcca 
gcgagcagcg 
gcgcggcgtg 
cagacagaca 
gacgcggcgg 
gtcggggctc 
ggaggagccg 
tcgggccggg 
agggggccgc 
gtgtgcgcag 
cggggaggaa 
cggagaggga 
ctgtcttggg 
caggctgcac 
gatgtctatc 
taccctgatg 



gcgctggggg 
cggactcacc 
ccctcttctt 
tttttgcttg 
cccaaatcac 
gagagaagtc 
aaagcgacag 
agccctcccc 
ccgcccccag 
cgagccgcgg 
gcggcgtcgc 
tggtccgcgc 
aggagccgca 
agtggcgact 
acagtgctcc 
gagtagctcg 
gcgcgagccg 
tgcattggag 
ccatggcaga 
agcgcagcta 
agatcgagta 



ctagcaccag 60 

ggccagg-gcg 12 0 

ttttcttaaa 180 

ccattcccca 240 

tgtggatttt 3 00 

gaggaag-aga 3 60 

gggcaaagtg 42 0 

cttgggatcc 480 

ccccagctac 540 

gcagggcjccg 600 

actgaaactt 660 

gggggacagcc 720 

gccggacjgag 780 

cggcgctcgg 840 

agccgcgjcgc 9 00 

ccgaggcgcc 960 

cgccggcccc 102 0 

ccttgccttg 1080 

aggagga-ggg 1140 

ctgccatcca 1200 

catcttcaag 12 60 
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ccatcctgtg tgcccctgat. gcgatgcggg ggctgctgca atgacgaggg cctggagtgt 132 0 

gtgcccactg aggagtccaa catcaccatg cagattatgc ggatcaaacc tcaccaaggc 13 80 

cagcacatag gagagatgacj cttcctacag cacaacaaat gtgaatgcag accaaagaaa 1440 

gatagagcaa gacaagaaaa aaaatcagtt cgaggaaagg gaaaggggca aaaacgaaag 150 0 

cgcaagaaat cccggtataa gtcctggagc gttccctgtg ggccttgctc agagcggaga 1560 

aagcatttgt ttgtacaaga. tccgcagacg tgtaaatgtt cctgcaaaaa cacagactcg 162 0 

cgttgcaagg cgaggcagct tgagttaaac gaacgtactt gcagatgtga caagccgagg 1680 

cggtgagccg ggcaggagga. aggagcctcc ctcagggttt egg 1723 



<210> 120 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003376 
<400> 120 

ccagcacata ggagagatget 

<210> 121 

<211> 2834 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003406 



gcttcctaca gcacaacaaa tgtgaatgca gaccaaagaa 60 



<400> 121 

gcccactccc 

aggagataaa 

gecgagcagg 

ggagctgaat 

ggagcccgta 

gagaaaaaac 

atetgeaatg 

gagagcaaag 

gccgctggtg 

tttgaaatca 

aacttctctg 

aagacagctt 

gacagcacgc 

caaggagacg 

tctgcctcat 

gttttttgtt 

catgtggttt 

tgttttcatc 

ttggcatagt 

gtctaagcaa 

ttggttccag 

ctgtggtaga 

tcaatgtgta 

attttactct 

gttagctttc 

taagaacaaa 

atgtatatca 

ttgcttttgt 

aatagttaac 

tttcatgaac 

ataaaagttt 



accgccagct 
aagaacatcc 
ctgagegata 
tatccaatga 
ggtcatcttcj 
agcagatggc 
atgtactgtc 
tcttctattt 
atgacaagaa 
gcaaaaagga 
tgttctattSL 
ttgatgaago 
taataatgea 
aagctgaagc 
tctaaaattt 
tacgatttafc 
ttatgtttaaL 
ttgaggtggo 
acttttggtai. 
agaaaactgc 
tcacaggtgt 
atcatacccc 
cacctttgao 
ggataagggo 
attatttttg- 
tgtaaaagta. 
agcacagcag 
gatttttttt 
agggaaataa 
aatccaagca 
tatgaatgga. 



ggaaccctgg 
agtcatggat 
tgatgacatg 
ggagaggaat 
gagggtegtc 
tcgagaatac 
tcttttggaa 
gaaaatgaaa 
agggattgtc 
aatgeaacca 
tgagattctg 
cattgetgaa 
attactgaga 
aggagaagga 
acacagtaga 
gacaggttta 
tattagggga 
caatatgggg 
cattgtggct 
ctacatactg 
agtaattgtg 
atggatacca 
tacagctgea 
agaaaeggtt 
ctacactcat 
aagatgeagg 
taaaacaaaa 
tttttttttt 
cttgagatga 
taattgttaa 
cttttcaact 



ggactacgac 
aaaaatgagc 
gcagcctgca 
cttctctcag 
tcaagtattg 
agagagaaaa 
aagttcttga 
ggagattact 
gatcagtcac 
acacatccta 
aactccccag 
cttgatacat 
gacaacttga 
ggggaaaatt 
ccatttgtca 
tgttacttct 
gtagagccag 
atgtggaatt 
tcaaaagggc 
gtttgtcctg 
ggtactttaa 
catattaaac 
gaagtgttcc 
cacattccat 
tttatttgta 
aaaaatgaat 
acccatgtat 
gatacttgee 
tggctagctt 
gaacacgtgt 
actttctcta 



gtccctcaaa 
tggttcagaa 
tgaagtctgt 
ttgcttataa 
aacaaaagac 
ttgagacgga 
tccccaatgc 
acegttaett 
aacaagcata 
tcagactggg 
agaaagectg 
taagtgaaga 
cattgtggac 
aaccggcctt 
tecatgetgt 
atttgaattt 
ttaacattta 
tttatacaag 
cagtgtaaaa 
geggggaata 
ggtttggagc 
catgtatatc 
tttagacaaa 
tatttgtaaa 
tttaaatgtt 
tgcttggtat 
ttaacttttt 
taacatgeat 
tgtttaatgt 
attaaattca 
cagcttttca 



ccttgcttct 
ggccaaacutg 
aactgagcaa 
aaatgttgta 
ggaaggtg-ct 
gctaagagat 
ttcacaagca 
ggctgagcjtt 
ccaagaagct 
tctggccctt 
ctctcttQca 
gtcatacaaa 
ateggatace 
ccaacttttg 
cccacaaata 
ctatatttcc 
gggagtt&tc 

ttataagtgt 
ctgcttcc:at 
aaagggatca 
acttacaagg 
tgtggaafcac 
gttgtgaccc 
gttacctg-ct 
ttaggcaa.ee 
teattaefcte 
tttaggattt 
gtgctgtaaa 
cttatgaaat 
tgtaagtcjga 
tgtaaatfciag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 
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tcttggttct gaaacttctc taaaggaaat tgtacatttt ttgaaattta ttccttattc 192 0 

cctcttggca gctaatgcj-g-c tcttaccaag tttaaacaca aaatttatca taacaaaaat 19 80 

actactaata taactactgt ttccatgtcc catgatcccc tctcttcctc cccaccctga 2040 

aaaaaatgag ttcctatttt ttctgggaga gggggggatt gattagaaaa aaatcjtagtg 2100 

tgttccattt aaaattttgg catatggcat tttctaactt aggaagccac aatgttcttg 2160 

gcccatcatg acattggcj-ta gcattaactg taagttttgt gcttccaaat cactttttgg 2220 

tttttaagaa tttcttgata ctcttatagc ctgccttcaa ttttgatcct ttattctttc 22 80 

tatttgtcag gtgcacaaga ttaccttcct gttttagcct tctgtcttgt caccaaccat 23 40 

tcttacttgg tggccatgta cttggaaaaa ggccgcatga tctttctggc tccactcagt 2400 

gtctaaggca ccctgcttcc tttgcttgca tcccacagac tatttccctc atcctattta 2460 

ctgcagcaaa tctctcctta gttgatgaga ctgtgtttat ctccctttaa aaccctacct 2520 

atcctgaatg gtctgtca_tt gtctgccttt aaaatccttc ctctttcttc ctcctctatt 2580 

ctctaaataa tgatgggcjct aagttatacc caaagctcac tttacaaaat atttcctcag 2640 

tactttgcag aaaacaccaa acaaaaatgc cattttaaaa aaggtgtatt ttttotttta 27 00 

gaatgtaagc tcctcaagag cagggacaat gttttctgta tgttctattg tgcctagtac 27 60 

actgtaaatg ctcaataaat attgatgatg ggaggcagtg agtcttgatg ataag-ggtga 2 82 0 
gaaactgaaa tccc 2 8 34 

<210> 122 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003406 
<400> 122 

tttagccttc tgtcttgtca ccaaccattc ttacttggtg gccatgtact tggaaaaagg 6 0 

<210> 123 

<211> 1938 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_003504 
<400> 123 

gatttggcgg gagtcttgac cgccgccggg ctcttggtac ctcagcgcga gcgccaggcg 60 

tccggccgcc gtggctatgt tcgtgtccga tttccgcaaa gagttctacg aggtggtcca 12 0 

gagccagagg gtccttctct tcgtggcctc ggacgtggat gctctgtgtg cgtgcaagat 180 

ccttcaggcc ttgttccagt gtgaccacgt gcaatatacg ctggttccag tttc-fcgggtg 2 40 

gcaagaactt gaaactgcat ttcttgagca taaagaacag tttcattatt ttattctcat 3 00 

aaactgtgga gctaatgtag acctattgga tattcttcaa cctgatgaag acacizatatt 3 60 

ctttgtgtgt gacacccata ggccagtcaa tgtcgtcaat gtatacaacg atacccagat 42 0 

caaattactc attaaacaag atgatgacct tgaagttccc gcctatgaag acatcttcag 480 

ggatgaagag gaggatgaag agcattcagg aaatgacagt gatgggtcag agccttctga 540 

gaagcgcaca cggttag&ag aggagatagt ggagcaaacc atgcggagga ggcagcggcg 600 

agagtgggag gcccggagaa gagacatcct ctttgactac gagcagtatg aatatcatgg 6 60 

gacatcgtca gccatggtga tgtttgagct ggcttggatg ctgtccaagg acctgaatga 720 

catgctgtgg tgggccatcg ttggactaac agaccagtgg gtgcaagaca agatcactca 7 80 

aatgaaatac gtgactgeitg ttggtgtcct gcagcgccac gtttcccgcc acaaccaccg 840 

gaacgaggat gaggagaaca cactctccgt ggactgcaca cggatctcct ttga^tatga 9 00 

cctccgcctg gtgctctacc agcactggtc cctccatgac agcctgtgca acaccagcta 9 60 

taccgcagcc aggttcaeigc tgtggtctgt gcatggacag aagcggctcc aggagttcct 102 0 

tgcagacatg ggtcttcccc tgaagcaggt gaagcagaag ttccaggcca tggacatctc 1080 

cttgaaggag aatttgcg-gg aaatgattga agagtctgca aataaatttg ggat^aagga 1140 

catgcgcgtg cagactttca gcattcattt tgggttcaag cacaagtttc tggccagcga 12 00 

cgtggtcttt gccaccatgt ctttgatgga gagccccgag aaggatggct cagggacaga 12 60 

tcacttcatc caggctc-fcgg acagcctctc caggagtaac ctggacaagc tgtaccatgg 132 0 
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cctggaactc gccaagaagc agctgcgagc 
caccaacctc gtcatctccc aggggccttt 
agatgtcatg ctgttctcta. ggccggcatc 
gtcctttgtg tgttcgacaa agaaccggcg 
ccccctgagc atggagcatgj gcacagtgac 
ctcggacagg aagaactttt ttgggagggc 
ccggatgctg cacaaccatt ttgacctctc 
caagtttctg gacgcactta tttccctcct 
cttcttattt atgtaactgcj ctttcattta 
tagaagccat tttttattaa. ataaaatgct 
aaaaaaaaaa aaaaaaaa 193 8 

<210> 124 
<211> 60 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_003504 
<400> 124 

caagtttctg gacgcactta. tttccctcct gtcctaggaa tttgattctt ccagaatgac 6 0 

<210> 125 

<211> 2346 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM„0036OO 
<400> 125 

acaaggcagc ctcgctcgacj cgcaggccaa tcggctttct agctagaggg tttaactcct 60 

atttaaaaag aagaaccttt gaattctaac ggctgagctc ttggaagact tgggbccttg 12 0 

ggtcgcaggt gggagccgac gggtgggtag accgtggggg atatctcagt ggcgg-acgag 180 

gacggcgggg acaaggggcgj gctggtcgga gtggcggagc gtcaagtccc ctgtcggttc 240 

ctccgtccct gagtgtcctfc ggcgctgcct tgtgcccgcc cagcgccttt gcatccgctc 3 00 

ctgggcaccg aggcgccctg- taggatactg cttgttactt attacagcta gaggcatcat 3 60 

ggaccgatct aaagaaaact gcatttcagg acctgttaag gctacagctc cagttggagg 42 0 

tccaaaacgt gttctcgtga ctcagcaatt tccttgtcag aatccattac ctgtaaatag 480 

tggccaggct cagcgggtct tgtgtccttc aaattcttcc cagcgcattc ctttgrcaagc 540 

acaaaagctt gtctccagtc acaagccggt tcagaatcag aagcagaagc aattgrcaggc 6 00 

aaccagtgta cctcatcctcj tctccaggcc actgaataac acccaaaaga gcaacj-cagcc 6 60 

cctgccatcg gcacctgaaa ataatcctga ggaggaactg gcatcaaaac agaaaaatga 72 0 

agaatcaaaa aagaggcagt gggctttgga agactttgaa attggtcgcc ctctcjggtaa 780 

aggaaagttt ggtaatgttt atttggcaag agaaaagcaa agcaagttta ttctg-gctct 840 

taaagtgtta tttaaagctc agctggagaa agccggagtg gagcatcagc tcagaagaga 900 

agtagaaata cagtcccacc ttcggcatcc taatattctt agactgtatg gttatttcca 9 60 

tgatgctacc agagtctacc taattctgga atatgcacca cttggaacag tttatagaga 102 0 

acttcagaaa ctttcaaagt ttgatgagca gagaactgct acttatataa cagaattggc 1080 

aaatgccctg tcttactgtc attcgaagag agttattcat agagacatta agccagagaa 1140 

cttacttctt ggatcagct^ gagagcttaa aattgcagat tttgggtggt cagtacatgc 12 0 0 

tccatcttcc aggaggacca ctctctgtgg caccctggac tacctgcccc ctgaa_atgat 12 6 0 

tgaaggtcgg atgcatgatg- agaaggtgga tctctggagc cttggagttc tttgctatga 132 0 

atttttagtt gggaagcctc cttttgaggc aaacacatac caagagacct acaaaagaat 13 80 

atcacgggtt gaattcacat tccctgactt tgtaacagag ggagccaggg acctcatttc 1440 

aagactgttg aagcataatc ccagccagag gccaatgctc agagaagtac ttgaacaccc 1500 

ctggatcaca gcaaattcat caaaaccatc aaattgccaa aacaaagaat cagctagcaa 156 0 

acagtcttag gaatcgtgca gggggagaaa tccttgagcc agggctgcca tataacctga 162 0 

caggaacatg ctactgaagt ttattttacc attgactgct gccctcaatc tagaacgcta 1680 

cacaagaaat atttgtttta. ctcagcaggt gtgccttaac ctccctattc agaaagctcc 1740 



cacccagcag accattgcca gctgcctttg 13 80 

cctgtactgc tctctcatgg aggcjcactcc 1440 

cctaagcctg ctcagcaaac acctgctcaa 15 00 

ctgcaaactg ctgcccctgg tgatgcjctgc 15 6 0 

cgtggtgggc atccccccag agaccgacag 162 0 

gtttgagaag gcagcggaaa gcaccagctc 1680 

agtaattgag ctgaaagctg aggatcggag 1740 

gtcctaggaa tttgattctt ccagaeitgac 1800 

gattgtaagt tatggacatg atttgagatg 1860 

tattttaggc tccgtcccca aaaaaaaaaa 192 0 
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acatcaataa acatgacact ctgaagtgaa agtagccacg agaattgtgc tacttatact 1800 

ggttcataat ctggaggcaa ggttcgactg cagccgcccc gtcagcctgt gcta_g-gcatg 1860 

gtgtcttcac aggaggcaaa tccagagcct ggctgtgggg aaagtgacca ctctgccctg 192 0 

accccgatca gttaaggagc tgtgcaataa ccttcctagt acctgagtga gtgtg-taact 1980 

tattgggttg gcgaagcctg gtaaagctgt tggaatgagt atgtgattct ttttaagtat 2 040 

gaaaataaag atatatgtac agacttgtat tttttctctg gtggcattcc tttag-gaatg 2100 

ctgtgtgtct gtccggcacc ccggtaggcc tgattgggtt tctagtcctc cttaaccact 2160 

tatctcccat atgagagtgt gaaaaatagg aacacgtgct ctacctccat ttag-ggattt 222 0 

gcttgggata cagaagaggc catgtgtctc agagctgtta agggcttatt tttttaaaac 22 80 

attggagtca tatgcatgtgt gtaaacttta aatatgcaaa taaataagta tctatgtcta 2340 
aaaaaa 2346 

<210> 126 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_0036 00 
<400> 126 

agagtgtgaa aaataggaac acgtgctcta cctccattta gggatttgct tgggatacag 60 

<210> 127 
<211> 853 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_003 6 41 
<400> 127 

ctagtcctga cttcacttct gatgaggaag cctctctcct tagccttcag cctttcctcc 6 0 

caccctgcca taagtaattt gatcctcaag aagttaaacc acacctcatt ggtccctggc 12 0 

taattcacca atttacaaac agcaggaaat agaaacttaa gagaaataca cacttctgag 180 

aaactgaaac gacaggggaa aggaggtctc actgagcacc gtcccagcat ccggacacca 240 

cagcggccct tcgctccacg cagaaaacca cacttctcaa accttcactc aacacttcct 3 00 

tccccaaagc cagaagatgc acaaggagga acatgaggtg gctgtgctgg gggcaccccc 3 60 

cagcaccatc cttccaaggt ccaccgtgat caacatccac agcgagacct ccgtgcccga 42 0 

ccatgtcgtc tg^tccctgt tcaacaccct cttcttgaac tggtgctgtc tgggcttcat 480 

agcattcgcc tactccgtga agtctaggga caggaagatg gttggcgacg tgaccggggc 540 

ccaggcctat gcctccaccg ccaagtgcct gaacatctgg gccctgattc tgggcatcct 600 

catgaccatt ggattcatcc tgtcactggt attcggctct gtgacagtct accatattat 660 

gttacagata atacaggaaa aacggggtta ctagtagccg cccatagcct gcaacctttg 720 

cactccactg tgcaatgctg gccctgcacg ctggggctgt tgcccctgcc cccttggtcc 780 

tgcccctaga tacagcagtt tatacccaca cacctgtcta cagtgtcatt caataaagtg 840 
cacgtgcttg tga 853 

<210> 128 
<211> 60 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_003641 
<400> 128 

attatgttac agataataca ggaaaaacgg ggttactagt agccgcccat agcctgcaac 60 
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<210> 129 
<211> 1280 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_003756 



<400> 129 

gaaagatggc 

ccggcgcagc 

tgcagataga 

gaactgaagt 

ccaactgctt 

atcagatgga 

ggtatcagtc 

gttaccagca 



gtcccgcaag 
agggaaaggc 
tggccttgtg 
tgttcaagga 
tcctttccct 
aatga-tgcgg 
cacatactat 
tgccattgaa 



gaaggtaccg 
aaaggcaaag 
gtattaaaga 
gtgcttttgg 
cagcacacag 
agccttcgcc 
ggctcattcg 
gaatctgtcg 



gctctactgc 
gcggctcggg 
taatcaaaca 
gtctggttgt 
aggatgatgc 
atgtaaacat 
ttacccgggc 
ttctcattta 



cacctcttcc 
agattcagcc 
ttatcaagaa 
agaagatcgg 
tgactttgat 
tgatcatctt 
actcctggac 
tgatcccata 



age tccaccg 
gtgaagcaag 
gaaggacaag 
cttgaaatta 
gaagtccaat 
cac^tgggct 
tctcagttta 
aaaactgccc 



60 

120 

180 

240 

300 

360 

420 

480 



aaggatctct 
aaaaggattt 
aagaagtgcc 
aaaagaagtc 
tggggaagaa 
aatacaacac 
agcgtcgcca 
aggacctgtc 
ttgeaggeca 
gcaagctctt 
gaaaagaagt 
ttgcatattg 
tgtctttcta 
aaaaaaaaaa 



ctcac taaag 
ttcccctgaa 
gattcjtaatt 
agctgttgca 
tctacagttg 
atacatgagg 
gcagg-agaat 
caaactcttc 
gataaacact 
catggcccag 
taacatgaac 
aaaagcacag 
gecataataa 
aaaaaaaaaa 



gcatacagac 
gcattgaaaa 
aaaaattcac 
gataaacatg 
ctgatggaca 
aatactagta 
atgcagcgcc 
aaaccaccac 
tactgecaga 
gctcttcaag 
tcttgaagtc 
aggatttctt 
aataaaaaaa 
1280 



tgactcctaa 
aagcaaatat 
atctgatcaa 
aattgctcag 
gagtggatga 
aacaacagca 
agagecgagg 
agccgcctgc 
acatcaagga 
aatacaacaa 
acaccagggc 
tagtgtcatt 
aaaaaaaaaa 



actgatggaa 
cacctttgag 
tgtcctaatg 
ccttgccagc 
aatgagccaa 
gcagaaacat 
agaacccccg 
caggatggac 
gttcactgcc 
ctaagaaaag 
aactcttgga 
gecgattttg 
aaaaaaaaaa 



gtt tgtaaag 
tac3.tgtttg 
tgg^aacttg 
agcaatcatt 
gatatagtta 
cagtatcagc 
ctccctgagg 
tcgctgctca 
caaaacttag 
gaagtttcca 
agaaatatat 
gctataacag 
aaaaaaaaaa 



540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 



<210> 130 
<211> 60 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_003756 
<400> 130 

tgagecaaga tatagttaaa tacaacacat acatgaggaa tactagtaaa caacagcagc 60 

<210> 131 
<211> 839 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_003 83 2 
<400> 131 

aagecacagg ctccctggct 
gcctggccgc cgccgcctat 
aagtggcggg cacgatcgca 
gccccggcgg cgcactcgcc 
cccactcaga gctgaggaag 
geaeggtcat cagtgaagaa 
aggcaacaag tcaaggataa 



ggegtcaget aaagtggctg ttgggtgtcc gcaggcttct 60 

aagctaccag gaggagcttt acgacttccc gtcctgcggg 12 0 

aggtagegea gaagcttctc aatggccagc gccagctgca 180 

tcacctgagc ctgggaggaa aattcttcca agga.tgatct 240 

cttttctact cagcagatgc tgtgtgtttt gat^ttgaca 3 00 

ggaateggat gctttcattg gatttggagg aaatgtgatc 3 60 

cgccaaatgg tatatcactg attttgtaga gctg-ctggga 42 0 
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gaaccggaag 
aagttacaca 
tggtacagat 
gtttgttttt 
gtaatctgaa 
gagagtcaat 
tggttttttt 



aataacatcc 
gattgatact 
gatctgcact 
aactgtagtt 
ttctttatgt 
taaacacctt 
ccctttggta 



attgtcatac 
gtttgcttac 
gtcaagtaaa 
ccagtattat 
atattcctag 
tactcttagg 
tgtcaataaa 



agctccaaac 
aattgcctat 
ctacagttag 
atgatcacta 
ctatatttca 
aatatagatt 
agtttatcca 



aacttcagat 
tacaacttgc 
gaatcctcaa 
tcgatttcct 
tacaaagtgt 
cggcagcctt 
tgtgtcagaa 



gaatttttac 
tataaaaagt 
agsLttggttt 
gga_gagtttt 
tttaagagtg 
cagtgaatat 
aaaaaaaaa 



480 
540 
600 
660 
720 
780 
839 



<210> 132 
<211> 60 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_003 8 32 



<400> 132 

gaagaaggaa tcggatgctt tcattggatt tggaggaaat gtgatcaggc aacaagtcaa 6 0 

<210> 133 
<211> 3128 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_003 9 81 
<400> 133 

gcttcgcccc gtggcgcggt ttgaaatttt gcggggctca acggctcgcg gagcggctac 6 0 

gcggagtgac atcgccggtg tttgcgggtg gttgttgctc tcggggccgt gtg-^agtagg 12 0 

tctggacctg gactcacggc tgcttggagc gtccgccatg aggagaagtg agg-tgctggc 180 

ggaggagtcc atagtatgtc tgcagaaagc cctaaatcac cttcgggaaa tatgggagct 240 

aattgggatt ccagaggacc agcggttaca aagaactgag gtggtaaaga agcatatcaa 3 00 

ggaactcctg gatatgatga ttgctgaaga ggaaagcctg aaggaaagac tcatcaaaag 3 60 

catatccgtc tgtcagaaag agctgaacac tctgtgcagc gagttacatg ttg-a.gccatt 42 0 

tcaggaagaa ggagagacga ccatcttgca actagaaaaa gatttgcgca cccaagtgga 480 

attgatgcga aaacagaaaa aggagagaaa acaggaactg aagctacttc aagagcaaga 540 

tcaagaactg tgcgaaattc tttgtatgcc ccactatgat attgacagtg cctcagtgcc 600 

cagcttagaa gagctgaacc agttcaggca acatgtgaca actttgaggg aaacaaaggc 66 0 

ttctaggcgt ga_g-gagtttg tcagtataaa gagacagatc atactgtgta tggaagaatt 72 0 

agaccacacc ccagacacaa gctttgaaag agatgtggtg tgtgaagacg aagatgcctt 7 80 

ttgtttgtct ttggagaata ttgcaacact acaaaagttg ctacggcagc tggaaatgca 840 

gaaatcacaa aatgaagcag tgtgtgaggg gctgcgtact caaatccgag agctctggga 900 

caggttgcaa atacctgaag aagaaagaga agctgtggcc accattatgt ctgggtcaaa 9 60 

ggccaaggtc cggaaagcgc tgcaattaga agtggatcgg ttggaagaac tgaaaatgca 102 0 

aaacatgaag aaagtgattg aggcaattcg agtggagctg gttcagtact ggg-<accagtg 1080 

cttttatagc caggagcaga gacaagcttt tgcccctttc tgtgctgagg actacacaga 1140 

aagtctgctc cagctccacg atgctgagat tgtgcggtta aaaaactact atgaagttca 12 00 

caaggaactc tttgaaggtg tccagaagtg ggaagaaacc tggaggcttt tcttagagtt 12 60 

tgagagaaaa gcttcagatc caaatcgatt tacaaaccga ggaggaaatc ttctaaaaga 132 0 

agaaaaacaa cgagccaagc tccagaaaat gctgcccaag ctggaagaag agttzgaaggc 13 80 

acgaattgaa ttgtgggaac aggaacattc aaaggcattt atggtgaatg ggcagaaatt 1440 

catggagtat gtggcagaac aatgggagat gcatcgattg gagaaagaga gagccaagca 1500 

ggaaagacaa ctgaagaaca aaaaacagac agagacagag atgctgtatg gcagcgctcc 1560 

tcgaacacct agcaagcggc gaggactggc tcccaataca ccgggcaaag cacgtaagct 162 0 

gaacactacc accatgtcca atgctacggc caatagtagc attcggccta tctttggagg 1680 

gacagtctac cactcccccg tgtctcgact tcctccttct ggcagcaagc cagtcgctgc 1740 

ttccacctgt tcagggaaga aaacaccccg tactggcagg catggagcca acaaggagaa 1800 

cctggagctc aacggcagca tcctgagtgg tgggtaccct ggctcggccc ccctccagcg 1860 

caacttcagc attaattctg ttgccagcac ctattctgag tttgcgaagg atccgtccct 192 0 
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ctctgacagt tccactgttg ggcttcagcg agaactttca aaggcttcca aatctgatgc 1980 

tacttctgga atcctcaatt caaccaacat ccagtcctga gaagccctga tcagtcaacc 2 040 

agctgtggct tcctgtgcct agactggacc taattatatg ggggtgactt ta.gtttttct 2100 

tcagcttagg cgtgcttgaa accttggcca ggttccatga ccatgggcct aacttaaaga 216 0 



tgtgaatgag 
tttgacttat 
gacttctcta 
actgtaggaa 
aactagtaat 
ctttgctctt 
tgtctttgtg 
ctgggagcag 
ggggtatgca 
gctctcatct 
cagccacatt 
gaactttgca 
cctctttacc 
tgtctgcctg 
tgtggacttg 
tacaattatt 
aaaaaaaa 3 



tgttacagtt 
atacatgaaa 
ccacataatt 
aatgattgac 
taacatcttt 
acctgaagcc 
tgtccgaagt 
ctttgttaaa 
tgtatgaaca 
cccccattcc 
gggag-tctgt 
catgtcacta 
tatcctctca 
cagcagtact 
gtgttagcca 
cttaaaaata 
128 



gaaagcccat 
agtttatggc 
ctacttgctg 
tatgcccatc 
tttgtttagg 
ttgtccccac 
tgagatggcc 
ttgtgttcag 
cacatgcttg 
tgtgcctact 
ttgttccaat 
ctggggaggt 
atcactactc 
actgtcaaca 
cgctgtttac 
aattaaaatc 



cataggttta 
aagaagtgca 
aagcatgatc 
cctgggggta 
catgttcaat 
cacacaggac 
tgccctactg 
ttctgttaca 
tcggaacgct 
ttgcctgagt 
gggttgagct 
gttcctgctc 
ttcttgaagc 
tagtgtaaat 
tcatacagta 
tgtatactta 



gtggtcctag 
aattttagca 
aaagcttgtt 
attttggcat 
taatgctgta 
agccttcctc 
ccaaagaggt 
cagtgcattg 
ttctcggcgt 
tcttctaccc 
gtctttgtcg 
tagcttccac 
actattattt 
ggttctcaaa 
cgtgtcctgt 
catttcaaaa 



ga-g-acttggt 222 0 

ta. tggggcct 22 80 

ttatttcacc 2340 

gtatacctgt 2400 

gctatcatag 2460 

c t gaagagaa 252 0 

gacaggaagg 2580 

ccctttgttg 2640 

ttgtcccttg 2700 

ccgcagttgc 2760 

tg-^agatctg 2 82 0 

ga. tgaggcgc 2 880 

attcttccgc 2940 

agcttaccag 3000 

ttttaaaata 3 0 60 

agaaaaaaaa 312 0 



<210> 134 
<211> 60 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_003981 



<400> 134 

tgcagcagta ctactgtcaa catagtgtaa atggttctca aaagcttacc ag-tgtggact 6 0 

<210> 135 
<211> 1816 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_004029 



<400> 135 

ggcacccagg gtccg-gcctg cgccttcccg ccaggcctgg acactggttc aacacctgtg 60 

acttcatgtg tgcgcgccgg ccacacctgc agtcacacct gtagccccct ctgccaagag 12 0 

atccataccg aggcagcgtc ggtggctaca agccctcagt ccacacctgt ggacacctgt 180 

gacacctggc cacacgacct gtggccgcgg cctggcgtct gctgcgacag gagcccttac 240 

ctcccctgtt ataacacctg accgccacct aactgcccct gcagaaggag ca^itggcctt 300 

ggctcctgag agggcagccc cacgcgtgct gttcggagag tggctccttg gagagatcag 3 60 

cagcggctgc tatga.ggggc tgcagtggct ggacgaggcc cgcacctgtt tccgcgtgcc 420 

ctggaagcac ttcgcgcgca aggacctgag cgaggccgac gcgcgcatct tcaaggcctg 480 

ggctgtggcc cgcgg-caggt ggccgcctag cagcagggga ggtggcccgc cccccgaggc 540 

tgagactgcg gagcgcgccg gctggaaaac caacttccgc tgcgcactgc gcagcacgcg 600 

tcgcttcgtg atgctgcggg ataactcggg ggacccggcc gacccgcaca aggtgtacgc 660 

gctcagccgg gagctgtgct ggcgagaagg cccaggcacg gaccagactg aggcagaggc 72 0 

ccccgcagct gtcccaccac cacagggtgg gcccccaggg ccattcttgg cacacacaca 7 80 

tgctggactc caagccccag gccccctccc tgccccagct ggtgacaagg gggacctcct 840 

gctccaggca gtgcaacaga gctgcctggc agaccatctg ctgacagcgt ca-fcggggggc 900 

agatccagtc ccaaccaagg ctcctggaga gggacaagaa gggcttcccc tga.ctggggc 960 

ctgtgctgga ggcgaggccg cggccccaga gtccccgcac caggcagagc cgtacctgtc 102 0 

accctcccca agcgcctgca ccgcggtgca agagcccagc ccaggggcgc tg^acgtgac 1080 
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catcatgtac aagggccgca cggtgctgca gaaggtggtg ggacacccga 
cctatacggc cccccagacc cagctgtccg ggccacagac ccccagcagg 
cagccctgcc gagctcccgg accagaagca gctgcgctac acggaggaac 
cgtggcccct gggttgcacc tggagcttcg ggggccacag ctgtgggccc 
caagtgcaag gtgtactggg aggtgggcgg acccccaggc tccgccagcc 
agcctgcctg ctgcctcgga actgtgacac ccccatcttc gacttcagag 
agagctggtg gaattccggg cacggcagcg ccgtggctcc ccacgctata 
gggcttcggg caggacctgt cagctgggag gcccaaggag aagagcctgg 
gctggaaccc tggctgtgcc gagtgcacct agagggcacg cagcgtgagg 
cctggatagc agcagcctca gcctctgcct gtccagcgcc aacagcctct 
cgagtgcttc cttatggagc tggagcagcc cgcctagaac ccagtctaat 
gaaagctgga gcagcccacc tagagctggc cgcggccgcc cagtctaata 
cagaacaaaa aaaaaa 1816 



gctgcacgtt 1140 

tagcattccc 12 00 

tcjctgcggca 12 60 

gcjcgcatggg 132 0 

cctccacccc 1380 

tcttcttcca 1440 

ccatctacct 1500 

tcctggtgaa 15 6 0 

gfcgtgtcttc 162 0 

atgacgacat 1680 

gagaactcca 1740 

aaaagaactc 1800 



<210> 136 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_004029 



<400> 136 

agcagcccac ctagagctgg ccgcggccgc ccagtctaat aaaaagaact ccagaacaaa 
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<210> 137 
<211> 2121 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_004203 



<400> 137 

tggaattttt 

cctccgcccg 

ggtcccgtgg 

cgccccaggg 

cttaccgtct 

gagacggccc 

ctcagggaca 

gaggctgttg 

ccacggaggg 

gccacgcaga 

ctccgccccc 

gctggcacca 

agagccctgg 

tcagccgcct 

gccggctcta 

gcaagttggc 

tggagcaggc 

gcctgcagca 
acctgcggga 
atgtcaagcc 
gactgctggt 
acatggcccc 
gcctcaccat 
agcagctgcg 
gttctgtcct 
tgctggcact 



ggcgcgagca 
ggcgtccgga 
cctccgcgga 
cttccccaca 
accgggaaag 
gatcgcccag 
ccctggattc 
ggccaggcca 
caccccgcca 
acctggattc 
tgccaagggc 
gctgcagccc 
gtatgaccca 
gggc catggc 
tgcggtaaag 
cgag*^tgggc 
ctgg-g-aggag 

acactgtgag 
cacgctgctt 
tgccaacatc 
ggag-ctgggt 
cgagctgctg 
cctg-g-aagtg 
ccagrg-gctac 
tgtcatgatg 
gcctgtgttg 



gctccgcgcg 
acagtcgacg 
cccggaatct 
cccacggagt 
gtggccagca 
ggccccggaa 
acgtgggagc 
gtcatgctag 
cctctgagtg 
tccctcaaga 
agcattccca 
cggcgggtgt 
agccggccag 
tcctacggag 
cgttccatgt 
agccacgaga 
ggcggcatcc 

gcctggggtg 
gccctggccc 
ttcctggggc 
acagcaggag 
cagggctcct 
gcatgcaaca 
ctgccccctg 
ctggagccag 
aggcagccgc 



cgttcacggg 
gcagactccg 
gggccctcgc 
gaagtcagcc 
gatgtgtcgg 
gctgcggagg 
ccctgctcct 
aacggcctcc 
gcacccccat 
ggcccagggg 
tcagccgcct 
cattccgggg 
agtccttctt 
aggtcttcaa 
caccattccg 
aggtggggca 
tgtacctgca 

ccagcctgcc 
atctgcacag 
cccggggccg 
ctggtgaggt 
atgggacagc 
tggagctgcc 
agttcactgc 
accccaagct 
gggcctgggg 



ccgttccccc 
gcccgctgag 
ggacccgcgc 
gcggccctgc 
gcctggtgag 
tcacccccgc 
gcctcccccg 
tgcactggcc 
cccagtccca 
gctcagccgg 
cttccctcct 
cgaggcctca 
ccagcagagc 
ggtgcgctcc 
gggccccaag 
gcacccatgc 
gacggagctg 

tgaggcccag 
ccagggcctg 
ctgcaagctg 
ccaggaggga 
agcggatgtg 
ccacggtggg 
cggtctgtct 
gcgggccacg 
tgtgctgtgg 



tcacgggagt 60 

ccacccgagg 12 0 

cccgcccagt 180 

ctgggaggaa 240 

ag-ggtgaggc 3 00 

ctggccttag 3 60 

tcccaccact 420 

atgcccatgc 480 

gcctacttcc 540 

agcctcccac 600 

ccfgaccccag 660 

gagactctgc 72 0 

ttzccagaggc 7 80 

aaggaggacg 840 

gaccgggccc 900 

t^cgtgcggc 9 60 

tcjcgggccca 102 0 

gtctggggct 1080 

gtgcaccttg 1140 

g^tgacttcg 12 00 

gacccccgct 1260 

ttcagtctgg 1320 

gagggc t ggc 13 80 

tccgagctgc 1440 

gccgaggccc 15 00 

tcjcatggcag 15 60 
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cggaggcccfc gagccgaggg tgggccctgt ggcaggccct gcttgccctg ctctgctggc 162 0 

tctggcatgg- gctggctcac cctgccagct ggctacagcc cctgggcccg ccagccaccc 1680 

cgcctggcto accaccctgc agtttgctcc tggacagcag cctctccagc aactgggatg 1740 

acgacagcct agggccttca ctctcccctg aggctgtcct ggcccggact g-tggggagca 1800 

cctccacccc ccggagcagg tgcacaccca gggatgccct ggacctaagt gacatcaact 186 0 

cagagcctcc tcggggctcc ttcccctcct ttgagcctcg gaacctcctc sigcctgtttg 192 0 

aggacaccct agacccaacc tgagccccag actctgcctc tgcactttta eiccttttatc 19 80 

ctgtgtctct cccgtcgccc ttgaaagctg gggcccctcg ggaactccca tggtcttctc 2 040 

tgcctggccg- tgtctaataa aaagtatttg aaccttggga gcacccaagc ttgctcatgt 210 0 

ggcaaaaaaa aaaaaaaaaa a 2121 



<210> 138 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_O04203 



<400> 138 

ctggccgtgt ctaataaaaa gtatttgaac cttgggagca cccaagcttg ctcatgtggc 



60 



<210> 139 
<211> 1982 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_O04207 



<400> 139 

ggcgagaggo 

ccatgggagQ- 

ggggctgggo 

aggccgtcacg- 

cagcctggat 

tgtgcgtgaei 

tgggcatggt 

tcatcacggcj 

acttcagcaa 

tgtgtgccct 

tcctcatcct 

tggtggtcac 

tgagcgtctt 

tggggctctt 

acaccaaggc 

ccgcgggct t 

tctccatgtt 

gcctcgtggt 

tcgaggtgc t 

tgctgatggsi 

cgacccacgt 

tgattttgct 

aggtggcggo 

tgcgggaggfc 

ccccggaaac 

aagccggcaa 

gggctccago 

cggggtgggsi 

aaggcatcct 

ctcaaggacc 



gggctgaggc 
ggccgtggtg 
cgtgctcttc 
tgtcttcttc 
ctcctccatc 
ccgctttggc 
ggctgcgtcc 
gttgggtttg 
gcggcgcccc 
gagcccgctg 
gggcggcctg 
ggcccagccg 
ccgggaccgc 
cgtcccgccc 
cgccttcctg 
cgtggcgggg 
cttcaacggc 
cttctgcatc 
catggccatc 
ggcggtggcc 
ctacatgtac 
gctgggcaac 
cgcggaggag 
ggagcatttc 
aagtgtctga 
cgcttgctat 
tgccggccca 
accgtgtcat 
caccaggggc 
tggaaaccca 



ggcccagcgg 
gacgagggcc 
ggctgtttcg 
aaggagctca 
ctgctggcca 
tgccggcccg 
ttttgccgga 
gcactcaact 
atggccaacg 
gggcagctgc 
ctgctcaact 
ggctcggggc 
ggctttgtgc 
gtgttcgtgg 
ctcaccatcc 
cttgggaagg 
ctcgcggacc 
ttctttggca 
gtgggcaccc 
gtgctcgtcg 
gtgttcatcc 
ttcttctgca 
gagaagctcc 
ctgaaggctg 
gtggctgggc 
ttattttaca 
gcggatcgtc 
tccagagtgg 
cccgcctgct 
tgcttcgaga 



cggcaggtga 
ccacaggcgt 
tcatcactgg 
tacaggagtt 
tgctctacgg 
tcatgcttgt 
gcatcatcca 
tccagccctc 
ggctggcggc 
tgcaggaccg 
gctgcgtgtg 
cgccgcgacc 
tttacgccgt 
tgagctacgc 
tgggcttcat 
tgcggcccta 
tggcgggctc 
tctcctacgg 
acaagttctc 
ggcccccttc 
tggcgggggc 
ttaggaagaa 
acaagcctcc 
agcctgagaa 
ggggccggca 
aactggactg 
gcccgatcag 
atctgcggtg 
gctcccaggt 
caacgtgact 



ggcggaacca 
caaggcccct 
cttctcctac 
tgggatcggc 
gacaggtccg 
ggggggtctc 
ggtctacctc 
gctcatcatg 
agcaggtagc 
ctacggctgg 
tgccgcactc 
ctcccggcgc 
ggccgcctcg 
caaggacctg 
tgacatcttc 
ctccgtctac 
tacggcgggc 
catggtgggg 
cagtgccatt 
gggaggcaaa 
cgaggtgctc 
gcccaaagag 
tgcagactcg 
aaacggggag 
ggcacaggga 
gctcaggcag 
tgttttgagg 
aagccaagcc 
ggcctgcggc 
ttaatgggag 



accctcctgg 60 

gacggcggct 12 0 

gccttcccca 180 

tacagcgaca 2 40 

ctctgcagtg 3 00 

-bttgcgtcgc 3 60 

accactgggg 42 0 

ctgaaccgct 480 

cctgtcttcc 540 

oggggcggc t 6 0 0 

atgaggcccc 660 

ctgctagacc 72 0 

^•tcatggtgc 7 80 

^gcgtgcccg 840 

cjcgcggccgg 9 00 

ctcttcagct 960 

gactacggcg 102 0 

gccctgcagt 10 80 

^gcctggtgc 1140 

otcctggatg 12 00 

acctcctccc 1260 

ccacagcctg 132 0 

Q-gggtggact 13 80 

gtggttcaca 1440 

<3*gaggtacag 1500 

ggccacggct 1560 

g-ggaaggtgg 162 0 

gcaaggttac 1680 

cactgctatg 1740 

ggtgggtggg 1800 
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ccgcagacag gctg-gcaggg caggtgctgc gtggggccct ctccagcccg tcctaccctg 1860 

ggctcacatg gggcctgtgc ccacccctct tgagtgtctt ggggacagct ctttccaccc 192 0 

ctggaagatg gaaataaacc tgcgtgtggg tggagtgttc tcgtgccgaa ttcaaaaagc 1980 
tt 1982 

<210> 140 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_0042 0 r 7 



<400> 140 

cctcttgagt gtcttgggga cagctctttc cacccctgga agatggaaat aaacctgcgt 

<210> 141 
<211> 2054 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_0042 0 9 



60 



<400> 141 

cgggaggcgg 

gcggcctcgg 

cccgcccggg 

ggaccccgtg 

ctccatcgcc 

cgagctgcgc 

cctcggagcc 

cagcagcgtc 

gtccttcctg 

agggccggcc 

ctccatcctc 

cgacatgtca 

ctatccggtg 

gaccctggac 

agaccagggc 

gggtccttcc 

aagagggggt 

cctggcccca 

gcctggaatg 

gggttggggg 

ggcaggggaa 

catgtccctg 

cactctccac 

tggtccagtc 

ccggcccctg 

agacaccagc 

gggtgccagg 

cgtgccccac 

tgcctctcca 

ctgactactc 

tgccacattt 

acattgtagc 

atatatcact 

tccgactgca 

aaaaaaaaaa 



cagcggctgc 
gcgg-ggccgg 
ccggccatgg 
agctttgcgc 
gtcttcgggc 
tgcgtgttca 
ttcctcgcct 
cgcgaccgcc 
tggttcgtgg 
acgacgcagg 
age fcgggtgg 
etc ttcgeca 
ggcagcggcg 
accagcccca 
tccaaggcca 
agctcagtgc 
ggacccgcgt 
ggactgaggt 
gactggcctg 
tccagctgcc 
agacaccacc 
tggg-tagtga 
tttctctcac 
ttcg-ggtttc 
cccaggtgtg 
cctgtcctag 
caagacaagc 
aagatggccc 
cctgcaccct 
atgeattget 
catcagacag 
ctagaccaat 
ctctccctct 
ccttctttac 
aaaa 2054 



agcgttggta 
ccggacggac 
agggcgcctc 
ggcggcccca 
ccatcgtcaa 
aegggaaege 
gcgccgcctt 
ggcgcgcggt 
gettctgett 
egggggaege 
cgctcaccgt 
ccgaacagct 
tggagggcac 
aagggtacca 
ccccaccaac 
cgeggacaga 
gtctgggctg 
cctgagaagg 
gggaaggctt 
ctctacgatc 
ctcgccccaa 
ctgtctcgtt 
aggctgetag 
acctcctagt 
ggtggttctg 
cccttcagta 
ccctcagcag 
ctgtgtggtt 
gcttcctggc 
caaagctggc 
acacctccct 
tctgtgtgga 
cctgaaagac 
aataaagact 



gcatcagcat 
aggeggacag 
cttcggcgcg 
gaccctgctc 
cgagggctac 
gggcgcctgc 
cctgctgctc 
gttgctggac 
cctcaccaat 
ggcgcgggcc 
gaaggccctg 
gagcaceggg 
cgagacctac 
ggtgcccgcc 
gcaggcccca 
gtaggtggcc 
cccctgccaa 
ggatagcact 
tcccctcttg 
aggtgcaggg 
gactggggat 
tctgtcatgg 
aacagcccag 
actccacaag 
gecaggaagg 
agaccttgcc 
gagagaggee 
ccctttacct 
ccagtcccag 
ttttcacatt 
ctggagatgc 
tatttaagtg 
cagagattgt 
gtaactgagc 



cagcatcagc 
aaggcgccag 
ggccgcgcag 
egggtcgegt 
gtgaacaccg 
cgcttcggcg 
gatgtgcgct 
ctgggcttct 
cagtggcagc 
gccatcgcct 
cagcggttcc 
gegagecagg 
cagagcccgc 
tactagegge 
gggtctcegg 
gctttgcgcc 
gttcccccag 
gcccaggacg 
ggccacacct 
gctgcccagg 
cctggccact 
tggtgcgtcc 
ccctgtcagt 
ctgctcctct 
cacaaggtag 
aggagaggag 
cagaggctcc 
tggcttcctg 
gttggagtcc 
aagtcaacac 
agttgagtga 
aacatgttta 
gtattttcag 
tgactgtgaa 



gcjcagcggca 6 0 

gcjgcgcgcgt 120 

gggccgccct 180 

cctgggtgtt 240 

acagcggccc 3 00 

tcgcgctggg 3 60 

tccagcaaat 42 0 

caggactctg 480 

gcacggcgcc 540 

tcagcttctt 600 

gcctgggcac 660 

cctaccccgg 72 0 

ccttcaccga 780 

tcjgcaggcac 840 

gacctccctt 900 

acccggggcc 9 60 

tccctcagca 1020 

tcjtgtcccta 1080 

gctcactctg 1140 

acaaageggg 12 00 

gttcccatcc 12 60 

ccjtccggagc 132 0 

gttgtgatca 13 80 

ctctgtggcc 1440 

ctgtgggcca 1500 

aaggatgect 1560 

acjctggccac 162 0 

gcccagtccc 1680 

etctgeatag 1740 

caaacgtggt 1800 

caaccttgtt 1860 

ca-atttttgt 1920 

tcrtcccatgt 1980 

aaaaaaaaaa 2 040 
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<210> 142 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_0O42O9 
<400> 142 

gatgcagttg agtgacaacc ttgttacatt gtagcctaga ccaattctgt gtggatattt 6 0 

<210> 143 
<211> 1224 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM„004217 
<400> 143 

ggccgggaga gtagcagtgc cttggacccc agctctcctc cccctttctc 
gcccagaagg agaactccta cccctggccc tacggccgac agacggctcc 
agcaccctgc cccagcgagt cctccggaaa gagcctgtca ccccatctgc 
atgagccgct ccaatgtcca gcccacagct gcccctggcc agaaggtgat 
agtgggacac ccgacatctt aacgcggcac ttcacaattg atgactttga 
cctctgggca aaggcaagtt tggaaacgtg tacttggctc gggagaagaa 
atcgtggcgc tcaaggtcct cttcaagtcc cagatagaga aggagggcgt 
ctgcgcagag agatcgaaat ccaggcccac ctgcaccatc ccaacatcct 
aactattttt atgaccggag gaggatctac ttgattctag agtatgcccc 
ctctacaagg agctgcagaa gagctgcaca tttgacgagc agcgaacagc 
gaggagttgg cagatgctct aatgtactgc catgggaaga aggtgattca 
aagccagaaa atctgctctt agggctcaag ggagagctga agattgctga 
tctgtgcatg cgccctccct gaggaggaag acaatgtgtg gcaccctgga 
ccagagatga ttgaggggcg catgcacaat gagaaggtgg atctgtggtg 
ctttgctatg a.Q-ctgctggt ggggaaccca ccctttgaga gtgcatcaca 
tatcgccgca tcgtcaaggt ggacctaaag ttccccgctt ctgtgcccac 
gacctcatct ccaaactgct caggcataac ccctcggaac ggctgcccct 
tcagcccacc cttgggtccg ggccaactct cggagggtgc tgcctccctc 
tctgtcgcct gatggtccct gtcattcact cgggtgcgtg tgtttgtatg 
tataggggaa agaagggatc cctaactgtt cccttatctg ttttctacct 
ttaataaagg ctgaagcttt ttgt 1224 

<210> 144 

<211> 60 

<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_004217 

<400> 144 

gtctgtgtat g-tatagggga aagaagggat ccctaactgt tcccttatct gttttctacc 60 

<210> 145 
<211> 983 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_004335 



tctaaggatg 60 

atctggcctg 120 

acttgtcctc 180 

ggagaatagc 240 

^•attgggcgt 3 00 

aagccatttc 3 60 

ggagcatcag 42 0 

gcgtctctac 480 

ccgcggggag 540 

cacgatcatg 6 00 

cagagacata 66 0 

cttcggctgg 72 0 

ctacctgccc 780 

cattggagtg 840 

caacgagacc 9 00 

c^ggagc c c ag 9 6 0 

ggcccaggtc 102 0 

tgcccttcaa 1080 

tctgtgtatg 1140 

cctcctttgt 1200 
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<400> 145 

gtggaattca 

aagcgctgta 

ggggtgccct 

cgggcagtga 

cagaagggct 

ctaatggctt 

ggagagatca 

agaagagaaa 

caggactcca 

ctgctgcagt 

tcgcttgaac 

cggggagagc 

gcagtcctgg 

cctccggaca 

gtggggggca 

tggggtcttt 

aaaaaaaaaa 



tggcatctac 
agcttctgct 
tgcLttatctt 
tgg-agtgtcg 
ttcaggatgt 
ceo tggatgc 
ctacattaaa 
accaggtctt 
gctccgctgc 
gagatcccag 
attcccttga 
aeggggtage 
gtgtggggac 
atgagtcccc 
tgtgctgcct 
gagctccaaa 
aaaaaaaaaa 



ttegtatgae 
ggggatagga 
caccatcaag 
caatgtcacc 
ggaggcccag 
agagaaggee 
ccataagctt 
aagcgtgaga 
ggcgccccag 
gaagctggca 
tctcatcagt 
eggagaaggg 
acagtegggt 
cctcttgtct 
gttgttatgg 
aaataaacac 
aaa 9 83 



tattgeagag 
attctggtgc 
gccaacagcg 
catctcctgc 
gccgccacct 
caaggacaaa 
caggacgegt 
ategeggaca 
ctgctgattg 
catcttggaa 
tetgageggg 
cctctggagc 
tgacccaggg 
cccaccctga 
gttttttttg 
ttcctttgag 



tgcccatgga 
tcctgatcat 
aggcctgccg 
aacaagagct 
gcaaccacac 
agaaagtgga 
ctgcagaggt 
agaagtacta 
tgctgctggg 
ggtccgtcct 
tcatggggca 
aggtctggag 
ctgtctccct 
gattgggcat 
eggggggggt 
ggagagcaaa 



a<gacggggat 60 

c^tgattctg 120 

gcgacggcctt 180 

gaccgaggcc 240 

tcjtgatggcc 3 00 

gcg-agcttgag 3 60 

g^agegactg 42 0 

ccccagctcc 480 

cctcagcgct 540 

geteggcttt 600 

acaeggttag 660 

g^gccatggg 72 0 

ccagagcctc 7 80 

gg-ggtgcggt 840 

tgetttttte 900 

a aaaaaaaaa 9 6 0 



<210> 146 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_0043 3 5 



<400> 146 

ggttgctttt ttctggggtc tttgagctcc aaaaaataaa cacttccttt gagggagagc 
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<210> 147 
<211> 3446 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_00433 6 



<400> 147 

ttctagtttg 

ggaaaatgtc 

tggtgaatgg 

cttgataact 

tgacccaaga 

attttttgag 

ctgggcgggg 

aggaattcaa 

gacacgcctc 

tgttcaggtt 

catttctaag 

gtcaaatatg 

tttggcatcc 

tggggaatca 

gcatgagcaa 

tgaagaacag 

ggagacatcc 

tatggggeca 

tcagcagaca 

tttggcaaat 

tcctgttcct 

tgctggatgt 



cggttcaggt 
ettcagatge 
gaaagataca 
ttactagaac 
ttcatcagtt 
tttctgtaca 
catctggaag 
aaccaggctg 
actgaaaccc 
ttaaatcaaa 
aatcagggtt 
gaacgaagag 
aaagttgatg 
gaattttcct 
tgggtaaatg 
ctattaaaac 
catgaggatc 
agtgtaggct 
ccagtgaaca 
gctatttctg 
ttgaaagece 
gtgaataaga 



ttgccgctgc 
ttgaagecca 
tacagtgggt 
atttaatgaa 
attgtttaaa 
accatgggat 
cccaaggaga 
aacccagaga 
atttgecage 
tgataacatc 
cagagctttc 
tgatcacgat 
ttgagcaggt 
ttgaagaatt 
aagacagaca 
agaaaatgga 
tgcccgcttc 
cccagcagga 
tggaaaagaa 
cagctttggt 
agacagtaac 
gtactcatga 



cggccagcgt 
catgeagage 
agaagagaat 
ggaattttta 
atttgetgag 
tggaaccctg 
getgeagcat 
gttcctgcaa 
tcaagctaga 
aaaatcaaat 
tggagtgata 
ttctaaatca 
tgttatgtat 
gagageccag 
ttatatgaaa 
tgaacttcat 
ccaggaaagg 
actgagagcg 
cccaagagag 
gtccccagcc 
agactccatg 
attcaageca 



cctctggcca 
tacaagggca 
tttcctgaga 
gataagaaga 
tacaacagtg 
tcatcccctc 
gccagtgctg 
caacaataca 
acctcagaac 
ccaggaaata 
tcttcagctt 
gaatattctg 
tgcaaggaga 
aaatacaatc 
aggaaagaag 
aagaagttgc 
tccgaggtta 
ccatgtcttc 
gcacctcctg 
accagccaga 
tttgcagtgg 
cagagtggag 



tggacacccc 6 0 

atgaccctct 12 0 

ataaagaata 180 

aataccacaa 240 

acctccatca 300 

tgtacattgc 3 60 

tccttcagag 42 0 

cfgttatttca 480 

etctgeataa 540 

a_catggcctg 6 00 

gj-fcgataaaga 660 

tgcactcatc 72 0 

agcttattcg 780 

aacggagaaa 840 

caaatgettt 900 

a.tcaggtggt 9 60 

atccagcacg 102 0 

cagtaaccta 1080 

ttgttcctcc 1140 

gcattgctcc 12 00 

ccagcaaaga 12 60 

cagagatcaa 132 0 
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agaagggtgt 
atcactggga 
aaaagaagca 
tgatgacaaa 
tcaaaaaaat 
atctgctttt 
aaataaaccc 
accaaaggag 
tcgctgcaac 
acaacttgcg 
taaagaaaat 
catggtggtg 
ggccttgtcg 
tggggtactt 
cgctgccatt 
gatgagttca 
taagctgatt 
ttttgaatgg 
taagctggtc 
tacccaggga 
gcctgccaac 
tatgcagcac 
agtaggagag 
ccctgaaaaa 
gattgagcaa 
acttggaaac 
gattgacctg 
aaagtgtgaa 
ccagatcgat 
gaaagtgaaa 
tttggatatg 
tccatctttg 
caagattagg 
aaaataaaat 
ctttaaacct 
atggaatatt 



gaaacacata 
sitggttcagg 
ttaggtttca 
cjatgaatggc 
cjtaaggtcat 
catgtgtttg 
acaggagcca 
gaagtgcctc 
aaaaccctgg 
tctacaccat 
g-tggtagcaa 
ccttcaaggg 
tctcacatgt 
aicctgtgagg 
gcagaagatc 
cttgggactg 
ttcaaacttt 
caatgtaaac 
tatgtccatc 
gatctgaatg 
ccctgggaat 
sitgtttatga 
ctctacagct 
gtgatgcctc 
g-tgcatgact 
Q-gatttttgg 
g-gtcagagta 
acatctggtt 
tactttgggg 
aatgaaggag 
tggaatgaat 
g-atttgttaa 
gccctacgta 
ttggatatag 
^ttttttttt 
tccatgtaaa 



aggttgccaa cacaagttct 
caacgccatc caaagtgcag 
tcatgaatat gtttcaggct 
aatctctaga tcaaaatgaa 
ctggggcttg gggagtcaat 
aagatggaaa caaagaaaat 
ggacctttgg agaacgctct 
atgctgaaga gtttttggat 
cacccagtcc taagagccca 
tccacaagct tccagtggag 
aacagtgtac ccaggcgact 
atggaaaatt cagtccaatt 
attcagcatc cttacttcgt 
cagagttggg cgttgaggct 
caccagatgc tattgctggg 
ttgatgctcc aaacttcatt 
tatctgggct ttctaaacca 
ttccagccat caagcccaag 
accttcttgg agaaggagcc 
atgctaaaaa taaacagaaa 
tctacattgg gacccagttg 
agttctattc tgcccactta 
atggaacatt attaaatgcc 
aaggtcttgt catctctttt 
gtgaaatcat tcatggagac 
aacaggatga tgaagatgat 
tagatatgaa actttttcca 
ttcagtgtgt tgagatgctc 
ttgctgcaac agtatattgc 
gagagtgtaa gcctgaaggt 
tttttcatgt tatgttgaat 
ggcaaaagct gaagaaagta 
ataggctaat tgtactgctc 
acagtcctta aaaatcacac 
catttattgt ttatgtaaat 
aaaaaa 3446 



tttcacacaa 
ccatcaccca 
cctacacttc 
gatgcatttg 
aagat catct 
tatgg-attac 
gtcag-cagac 
gactcaactg 
ggaga_cttca 
tcagtgcaca 
ttgga_ttctt 
caaga gaaaa 
ctgagccagc 
tgcag-actca 
ctccaagcag 
gttgg-gaacc 
gtgag-ttcct 
actga_atttc 
tttgc ccagg 
tttgt tttaa 
atggaaagac 
ttccagaatg 
attaacctct 
gctat gagaa 
attaaaccag 
ttatctgctg 
aaagg-aacta 
agcaa-caaac 
atgct ctttg 
ctttt tagaa 
attccagatt 
tttcaacaac 
ttaga.a.tgta 
tgtaa_atatg 
gtttg-ttaaa 



ctccaaacac 
ccgtgcacac 
ctgatatttc 
aagcccagtt 
cttctttgtc 
cacagcctaa 
ttccttcaaa 
tatggggtat 
catctgctgc 
ttttagaaga 
gtgaggaaaa 
gcccaaaaca 
ctgctgcagg 
cagacactga 
aatggatgca 
catgggatga 
atccaaatac 
aattgggttc 
tgtacgaagc 
aggtccaaaa 
taaagccatc 
gcagtgtatt 
ataaaaatac 
tgctttacat 
acaatttcat 
gcttggcact 
tattcacagc 
catggaacta 
gcacttacat 
ggcttcctca 
gtcatcatct 
actatactaa 
agcgttcacg 
aatctgctca 
aataaatccc 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 



<210> 148 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM 004=336 



<400> 148 

ttagggccct acgtaatagg ctaattgtac tgctcttaga atgta_a.gcgt tcacgaaaat 

<210> 149 
<211> 739 
<212> DNA 

<213> Homo sapiens 



60 



<300> 

<308> NM 004345 



<400> 149 

taaagcaaac cccagcccac accctggcag gcagccaggg atggg*"tggat caggaaggct 60 

cctggttggg cttttgcatc aggctcaggc tgggcataaa ggag/g-otcct gtgggctaga 12 0 

gggaggcaga catggggacc atgaagaccc aaagggatgg ccactccctg gggcggtggt 180 

cactggtgct cctgctgctg ggcctggtga tgcctctggc catcattgcc caggtcctca 240 
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gctacaagga a_g-ctgtgctt cgtgctatag atggcatcaa ccagcggtcc tcggatgcta 3 00 

acctctaccg cctcctggac ctggacccca ggcccacgat ggatggggac ccagacacgc 3 60 

caaagcctgt gagcttcaca gtgaaggaga cagtgtgccc caggacgaca cagcagtcac 420 

cagaggattg tgacttcaag aaggacgggc tggtgaagcg gtgtatgggg acagtgaccc 480 

tcaaccaggc caggggctcc tttgacatca gttgtgataa ggataacaag eigatttgccc 540 

tgctgggtga tttcttccgg aaatctaaag agaagattgg caaagagttt aaaagaattg 600 

tccagagaat caaggatttt ttgcggaatc ttgtacccag gacagagtcc fcagtgtgtgc 660 

cctaccctgg ctcaggcttc tgggctctga gaaataaact atgagagcaa tttcaaaaaa 72 0 

aaaaaaaaaa aaaaaaaaa 73 9 



<210> 150 
<211> 60 
<212> DNA 
<213> Homo 



s apiens 



<300> 

<308> NM_J)04345 



<400> 150 

gcaaagagtt taaaagaatt gtccagagaa tcaaggattt tttgcggaat cttgtaccca 6 0 

<210> 151 
<211> 1432 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_004577 



<400> 151 

gaggaaaatt 

agatgctgtg 

agccaaaatc 

ggcagtgcct 

gcaggtgcag 

ggtaagtcgc 

tattgtagag 

gctgaaattc 

atctggtgga 

aatcatgatt 

tggatttgga 

tgattttgta 

acaacttcag 

attacaactt 

tcctagaaga 

tgatttcctg 

ggagtgcagt 

tcctgcctca 

tttttgtatt 

tgaccttgtg 

cgcgcccagc 

tatatttcat 

atacggattc 

atccgtatgt 



c ttccagcga 
t cjttttgatg 
tQ"tggcgttg 
ttcaaagctg 
agactcatag 
c tacaggagc 
catgttgctt 
tactttaacg 
aaaggaaaag 
g- gagatggtg 
cj-g-aaatgtga 
g-a.gctgctgg 
a_tgaattttt 
gctatagaaa 
ttgctttttt 
g-a.gaggt 1 1 1 
cfg-cgcggtct 
gcctcccgag 
tttagtagag 
atccgcctgc 
caatgtccta 
acaaagtgct 
cj-g-cagccttc 
cagaacggat 



tggtctccca 
ttgacagcac 
aggacgcggt 
ctctcacaga 
cagagcaacc 
gaaatgttca 
caaagctcaa 
gtgaatatgc 
tgattaaact 
ccacagatat 
tcaggcaaca 
gagaactgga 
acaagttata 
gttggtacaa 
ttttttttta 
tttttttttt 
cggctcactg 
tagctgggac 
acggggtttg 
ctcagcctcc 
gagagttttg 
ttaagtgtgg 
agtgaatatt 
ttgtggaaaa 



ctcagagctg 
ggtcatcaga 
gtcagaaatg 
gcgcttagcc 
cccacacctg 
ggttttccta 
tatcccagca 
aggttttgat 
tttaaaggaa 
ggaagcctgt 
agtcaaggat 
agaataacat 
cagattgata 
atgatctgta 
actgtagttc 
gagacagaat 
caagctctgc 
tacaggcacc 
accgtgttag 
caaagtgctg 
tgatctgaat 
agagtcaatt 
ggtttctctt 
aaaaaaaaaa 



aggaagcttt 
gaagaaggaa 
acacggcgag 
ctcatccagc 
acccccggca 
atatctggtg 
accaatgtat 
gagacgcagc 
aaatttcatt 
cctcctgctg 
aacgccaaat 
ccattgtcgt 
ctgtttgctt 
ctttaaacta 
cagtattata 
cttgctctgfc 
ctcccaggtt 
cgccaccaca 
ccaggatggt 
ggattacagg 
tctttatgta 
aaacaccttt 
tggtatgtca 
aaaaaaaaaa 



tctactcagc 
tcgatgagct 
ccatgggcgg 
cctccaggga 
taagggagct 
g*ctttaggag 
ttgccaatag 
caacagctga 
ttaagaaaat 
SLtgctttcat 
cg-gtatatcac 
acagctccaa 
acagttgcct 
cagttaggaa 
tgatgactat 
tgcccaggct 
cacgccattc 
tccggctaat 
cttgatctcc 
cttgggccac 
tatttgtagc 
actcttagaa 
sitaaaag-ttt 
SL3L 1432 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 



<210> 152 
<211> 60 
<212> DNA 

<213> Homo sapiens 



<300> 
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<308> NM_004577 



<400> 152 

tagaaatacg gattcggcag ccttcagtga atattggttt ctctttggta tgtcaataaa 60 



<210> 153 
<211> 1530 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_004=701 



<400> 153 

aatcctggaa 

ctaacggcgc 

gctggcactc 

tgatttggag 

gcgaactgtt 

gaaagctcag 

actgaaacct 

ttctcccaca 

tgccttgctc 

cagtgactac 

aaacccacat 

ttggctggta 

tggcattatg 

tgggattact 

agactttgtt 

tctaattttg 

gcgagcatca 

ggagctgact 

tgcttcctgc 

ttacacagga 

ggtgaaagta 

caaactcctg 

ctccccactg 

gccctttttc 

tgaaacctct 

ttttcttaag 



caaggctaca 
ctcgtacgct 
ttgccttccc 
aatattgaca 
t tagaagaaa 
aacaccaaag 
actgcttctg 
cctgaggatg 
fcgcaaaatcg 
g-ttaaggata 
ttcttagatg 
caagtccact 
O^tcgatttt 
gctctgctct 
tacatcacag 
aaagaat t ga 
aaagccgggg 
ctcatcgact 
ttgtctcaga 
tacacagaga 
aatgaaaact 
aagatcagca 
ataggaaggt 
fctattggttt 
tctcagacca 
gtaaaaaaaa 



gcgtcgaaga 
agtgtcctcc 
cgtccctcat 
caggagttaa 
ttggaaatag 
ttccagttca 
tcaaaccagt 
tctccatgaa 
aggacattga 
tctatcagta 
gaagagatat 
ccaagtttag 
tacaggttca 
tggcttccaa 
acaatgctta 
aatttgagtt 
aggttgatgt 
atgatatggt 
aggttctagg 
atgaagtatt 
taactaaatt 
tgatccctca 
cctaggctgc 
agaactcttg 
gttttctaaa 
aaaaaaaaaa 



tccccagcgc 
cttttcagtc 
ggcgctgctc 
ttctaaagtt 
agttacaacc 
acccaccaaa 
acagatggaa 
ggaagagaat 
taacgaagat 
tctcaggcag 
aaatggacgc 
gcttctgcag 
gccagtttcc 
gtatgaggag 
taccagttcc 
gggtcgaccc 
tgaacagcac 
gcattatcat 
acaaggaaaa 
ggaagtcatg 
catcgccatc 
gctgaactca 
cgtgggccct 
attttgtaca 
catatattga 
1530 



tgcgggctcg 
cgcgtccctc 
cgacgcccga 
aagagtcatg 
agagcagcac 
acaacaaatg 
aagttggctc 
ctctgccaag 
tgggagaacc 
ctggaggttt 
atgcgtgcca 
gagactctcjt 
cggaagaagc 
atgttttcfcc 
caaatccgag 
ttgccactac 
actttagccia 
ccttctaagg 
tggaacttaa 
cagcacatcjg 
aagaataagt 
aaagccgtca 
ggggatgtg-t 
tagtcctctg 
ggaaaaataa 



gagagcagtc 
cctgggccgg 
cggtgtccag 
tgactattag 
aagtagctaa 
tcaacaaaca 
caaagggtcc 
ctttttctga 
ctcagctctg 
tgcagtccat 
tcctagtgga 
acatgtgcgt 
ttcaattagt 
caaatattga 
aaatggaaac 
acttcttaag 
agtatttgat 
tagcagcagc 
agcagcagta 
ccaagaatgt 
atgcaagcag 
aagaccttgc 
gcttcattgt 
gtctatctca 
agcgattggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 



<210> 154 
<211> 60 
<212> DNA 
<213> Homo 



sapiens 



<300> 

<308> NM_004701 



<400> 154 

agaactcttg attttgtaca tagtcctctg gtctatctca tgaaacctct tctcagacca 60 



<210> 155 
<211> 2536 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NMJD04702 



<400> 155 
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agcgggtgcg gggcgggacc ggcccggcct atatattggg ttggcgccgg 

gccgagcggt agctggtctg gcgaggtttt atacacctga aagaagagaa 

aagtagccgt ttacaagcta agcagcagcc ccagcccagc cagacggaat 

agcccagata atccaggcca agaagaggaa aactacccag gatgtcaaaa 

taaacatgtt aaaaaaggag agcagatatg ttcatgacaa acattttgaa 

ctgacttgga accacagatg aggtccatac ttctagactg gcttttagag 

tatacacact tcatagggaa acattttatc ttgcacaaga cttttttgat 

tgacacaaaa ggatataaat aaaaatatgc ttcaactcat tggaattacc 

ttgcttccaa acttgaggaa atctatgctc ctaaactcca agagtttgct 

atggtgcttg cagtgaagag gatatcttaa ggatggaact cattatatta 

aatgggaact ttgtcctgta acaatcatct cctggctaaa tctctttctc 

ctcttaaaga tgctcctaaa gttcttctac ctcagtattc tcaggaaaca 

tagctcagct tttagatctg tgtattctag ccattgattc attagagttc 

tactgactgc tgctgccttg tgccatttta cctccattga agtggttaag 

gtttggagtg ggacagtatt tcagaatgtg tagattggat ggtacctttt 

taaaaagtac tagtccagtg aagctgaaga cttttaagaa gattcctatg 

ataatatcca gacacataca aactatttgg ctatgctgga ggaagtaaat 

ccttcagaaa agggggacag ttgtcaccag tgtgcaatgg aggcattatg 

agagcactga aaaaccacca ggaaaacact aaagaagata actaagcaaa 

ttcaccaaga ttgggtagaa ctggtatcac tgaactacta aagttttaca 

ctgtgattga ttgccctagc caattcacaa gttacactgc cattctgatt 

caattggcac taaagaatac atttaattat ttcctatgtt agctgttaaa 

gacttgttta caaagatgtc ttcattccca aggttactgg atagaagcca 

ataccatagc aatgtttttc ctttaatcca gtgttactgt gtttatcttg 

aattttgtca ctggagtttt ggactggata agtgctacct taaagggtat 

acagtacttt gaatctagtt gttagattct caaaattcct acactcttga 

ttggttcttg aaaattaaat ttaaacttgt ttacaaaggt ttagttttgt 

ctaatttatc tatagctgct atagcaagct attataaaac ttgaatttct 

aaatttaatg ttttttaaac tagtttattt gccttgccat aacacatttt 

aggcttagat gaacatggtg ttcaacctgt gctctaaaca gtgggagtac 

ataaacaaga taaatgctgt ggctccttcc taactggggc tttcttgaca 

ttggtaataa octttttgta tatcacaatt tgggtgaaaa acttaagtac 

tatttatatg aggaagtcac tttactactc taagatatcc ctaaggaatt 

atttagtgtg actaaggctt tatttatgtt tgtgaaactg ttaaggtcct 

ctccattgtg agataaggac agtgtcaaag tgataaagct taacacttga 

tattttctta aggaagaaga gtattaaata tatactgact cctagaaatc 

aaaaagacat gaaaacttgc tgtacatagg ctagctattt ctaaatattt 

tttctaaaaa aaaaatccag cctcataaag tagattagaa aactagattg 

ttgttatcag eitatgtgaat ctcttctccc tttgaagaaa ctatacattt 

tatgaagtct tctgtatagt ttgtttttaa actaatattt gtttcagtat 

aagaaaacac cactaattgt gtacatatgt attatataaa cttaaccttt 

tatttttagc ccattgttta aaaaataaaa gttaaaaaaa tttaactgct 

aaaaaaaaaa aaaaaa 253 6 

<210> 156 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_0 0 47 02 
<400> 156 

gtttgtgaaa ctgttaaggt cctttctaaa ttcctccatt gtgagataag gacagtgtca 60 

<210> 157 
<211> 1491 
<212> DNA 

<213> Homo sapiens 
<300> 



cgccagctga 60 

tgtcaagacg 12 0 

ccccccaaga 180 

gaagtctggc 2 40 

gttctgcatt 300 

gtatgtgaag 3 60 

agatttatgt 42 0 

tcattattca 480 

tacgtcactg 540 

aaggctttaa 600 

caagttgatg 660 

ttcattcaaa 720 

cagtacagaa 7 80 

aaagcctcag 840 

gtcaatgtag 900 

gaagacagac 960 

tacataaaca 102 0 

acaccaccga 1080 

caagttggaa 114 0 

gaaagtagtg 12 00 

ttaaaactta 1260 

gaaacagcag 132 0 

accacagtct 13 80 

ataaactagg 1440 

actaagtgat 1500 

ctagtgcaat 1560 

aataaggtga 162 0 

acaaatggtg 1680 

ttaactaata 1740 

caaagaaatt 1800 

tgtaggttgc 1860 

cctttcaaac 192 0 

ttttttttta 1980 

ttctaaattc 2040 

cctaaacttc 2100 

tatttattaa 2160 

taaattagct 2220 

ctagtttatt 2280 

attgttacgg 23 40 

tttgtctgaa 2400 

taatactgtt 2460 

taaaagtaaa 252 0 
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<308> NM_O04710 
<400> 157 

gcg-gcggcgg- cagcggcggc gacggcgaca tggagagcgg ggcctacggc gcggccaagg 6 0 

cgggcggctc cttcgacctg cggcgcttcc tgacgcagcc gcaggtggtg gcgcgcgccg 12 0 

tgtgcttggt cttcgccttg atcgtgttct cctgcatcta tggtgagggc tacagcaatg 180 

cccacgagtc taagcagatg tactgcgtgt tcaaccgcaa cgaggatgcc tgccgctatg 240 

gcagtgccat cggggtgctg gccttcctgg cctcggcctt cttcttggtg ^tcgacgcgt 3 00 

atttccccca gatcagcaac gccactgacc gcaagtacct ggtcattggt gacctgctct 3 60 

tctcagctct ctggaccttc ctgtggtttg ttggtttctg cttcctcacc aaccagtggg 420 

cagtcaccaa cccgaaggac gtgctggtgg gggccgactc tgtgagggca gccatcacct 480 

tcagcttctt ttccatcttc tcctggggtg tgctggcctc cctggcctac cagcgctaca 540 

aggctggcg--fc ggacgacttc atccagaatt acgttgaccc cactccggac cccaacactg 600 

cctacgcctc ctacccaggt gcatctgtgg acaactacca acagccaccc ttcacccaga 6 60 

acgcggagac caccgagggc taccagccgc cccctgtgta ctgagcggcg ^ttagcgtgg 72 0 

gaagggggac agagagggcc ctcccctctg ccctggactt tcccatgagc otcctggaac 7 80 

tgccagcccc tctctttcac ctgttccatc ctgtgcagct gacacacagc taaggagcct 840 

catagcctg-g- cgggggctgg cagagccaca ccccaagtgc ctgtgcccag agggcttcag 9 00 

tcagccgctc actcctccag ggcattttta ggaaagggtt ttcagctagt ^tttttcctc 960 

gcttttaatQ- acctcagccc cgcctgcagt ggctagaagc cagcaggtgc ccatgtgcta 102 0 

ctgacaagt^ cctcagcttc cccccggccc gggtcaggcc gtgggagccg ctattatctg 1080 

cgttctctg-c caaagactcg tgggggccat cacacctgcc ctgtgcagcg gagccggacc 1140 

aggctcttg-t gtcctcactc aggtttgctt cccctgtgcc cactgctgta tgatctgggg 12 00 

gccaccaccc tgtgccggtg gcctctgggc tgcctcccgt ggtgtgaggg cggggctggt 12 60 

gctcatggca cttcctcctt gctcccaccc ctggcagcag ggaagggctt tgcctgacaa 132 0 

cacccagctt tatgtaaata ttctgcagtt gttacttagg aagcctgggg agggcagggg 1380 

tgccccatg-^ ctcccagact ctgtctgtgc cgagtgtatt ataaaatcgt gggggagatg 1440 
cccggcctg-g- gatgctgttt ggagacggaa taaatgtttt ctcattcagt a 1491 

<210> 158 
<211> 60 
<212> DNA. 

<213> Homo sapiens 
<300> 

<308> NM_004710 
<400> 158 

ttgcctgaca acacccagct ttatgtaaat attctgcagt tgttacttag gaagcctggg 6 0 

<210> 159 
<211> 332 4 
<212> DNA. 

<213> Homo sapiens 
<300> 

<308> NM_004856 
<400> 15^ 

gcagagcaoc gcgccttagc cgcgaagttc tagttcttgc tgccggtcct aacgtcccgc 6 0 

agtcttcgcc agccagccgt cccgcatgcg cgtttgggcg gcgtggagcc tgctgccatg 12 0 

aagtcagcg-a gagctaagac accccggaaa cctaccgtga aaaaagggtc ccaaacgaac 180 

cttaaagacc cagttggggt atactgtagg gtgcgcccac tgggctttcc tgatcaagag 240 

tgttgcatcLg; aagtgatcaa taatacaact gttcagcttc atactcctga gggctacaga 3 00 

ctcaaccgaa atggagacta taaggagact cagtattcat ttaaacaagt atttggcact 3 60 

cacaccacoc agaaggaact ctttgatgtt gtggctaatc ccttggtcaa tgacctcatt 42 0 

catggcaaaa atggtcttct ttttacatat ggtgtgacgg gaagtggaaa aactcacaca 480 

atgactggtt ctccagggga aggagggctg cttcctcgtt gtttggacat gatctttaac 540 

agtataggg-t catttcaagc taaacgatat gttttcaaat ctaatgatag gaatagtatg 600 

gatatacag-t gtgaggttga tgccttatta gaacgtcaga aaagagaagc tatgcccaat 660 

ccaaagactt cttctagcaa acgacaagta gatccagagt ttgcagatat gataactgta 72 0 
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caagaattct 
tatattgaaa 
ataaaaccca 
gttgcaggat 
agaggccaga. 
catagcgtgt 
ttacaggaaa 
gaaagaacta 
cagtcactaa 
actaacaaga 
gatggggaag- 
gaaaacttgc 
gtagacaag-g- 
ccagttgga.3. 
tcatgcgaaa 
gagaaacgac 
tttaaagct t 
gggaaactaa 
gaaaagaaaa 
tatgaggaag 
cgacagtttt 
acaatgaagt 
aataaactct 
aaaactgaga 
tctgtttct c 
cgtctccgac 
tctaacatgc 
gttgcaaatg 
gcctccgatg 
ggtggacaa-t 
agtcgaaaac 
tggaccgatg 
ggacctggat 
agtactgaaa 
tcttccagcjt: 
atacactgac 
ccttgtata.^ 
catcactgta 
ctaattcaca 
attttttat t 
ctgctatttei 
ataaagatgt 
tgtatatat t 
gaaataaaaa 



gcaaagcaga 
tatataataa 
aacctccaca 
gtacagaagt 
aaaagagacg 
tcaacattaa 
aagaacaaat 
accggaccag 
tgacgctaag 
tggttccata 
gaaaagtgcg 
aagtcatgag 
caatatgtgg 
atgaaccatt 
ttttggatat 
ataacttacg 
tgttacaaga 
atgaaaagga 
acaaaacttt 
ataaacgcaa 
ctgacaaacg 
gggagaaaga 
gggttaaaga 
agccagagag 
catcacctgt 
acagacgatc 
aaactgaaac 
aaaaggcact 
gggagattga 
ctgttcagtt 
gaagatcttc 
tagaaacaag 
atcagcatca 
gaacattttc 
catcttgtag 
ccagagcaaa 
tatatgtttt 
tgaatttttt 
caagtgtttg 
gtgacttctc 
ttagctgcaa 
aaatgactgg 
cactttctga 
aaaaaaaaaa 



agaggttgat 
ttacatatat 
atctaaattg 
tgaagtgaaa 
tattgctaat 
attagttcag 
cactataagt 
agcagaaggg 
aacatgtatg 
tcgagattca 
gatgatcgtg 
atttgcggaa 
tttaacgcct 
ggttactgac 
caacgatgag 
acaaatgatg 
atttgacaat 
gaagatgatc 
agaatataag 
tttgcaacag 
cagattagaa 
atgtgagcgt 
tgaaaagctg 
accctctcgg 
gcctttactc 
acgctctgca 
agtcatgcag 
agctaagtgt 
aactaaacta 
tactgatatt 
cacagtagca 
gtgttctgtg 
cgcacaaccc 
atttgtgtgg 
aactccagct 
gctttcccta 
gccatattta 
ataatgtttt 
tcttagatga 
cagccctggt 
gaatgcactt 
ccaataataa 
catttagata 
aaaa 332 4 



gaagatagtg 
gatctattgg 
cttcgtgaag 
tctactgagg 
acccatttga 
gctcccttgg 
cagttgtcct 
aacagattac 
gatgtcctaa 
aagttaaccc 
tgtgtgaacc 
gtgactcaag 
gggaggagat 
gtggttttgc 
cagacacttc 
attgatgagt 
gctgttttaa 
tcaggacaga 
attgagattt 
gaacttgaaa 
gccaggttgc 
agagtggcag 
aaacaactga 
gagcgagatc 
tttcaacctg 
ggagacagat 
ccacatgtcc 
gagaagtaca 
attaagggtg 
gagactttaa 
cctgcccaac 
gctgtggaga 
aagcgcaaaa 
atgatttctc 
ttgttgaaaa 
tggttccaaa 
atattaatag 
tttaaaatat 
ttaaggaaga 
ctgaatttct 
tagaactatt 
ccattttagg 
tgccaaaaga 



tctatggtgt 
aagaggtgcc 
ataagaacca 
aggcttttga 
atcgtgagtc 
atgcagatgg 
tggtagatct 
gtgaagctgg 
gagagaacca 
atctgttcaa 
ccaaggctga 
aagttgaaLgt 
acagaaacca 
agagttttcc 
caaggctgat 
ttaacaaaca 
gtaaagaaaa 
aattggaaat 
tagagaaaac 
ctcagaacca 
aaggcatggt 
ccaaacagct 
aggctatfcgt 
gagaaaaagt 
atcagaac gc 
gggtagat ca 
ctcatgccat 
tgctgaccca 
atatttataa 
agcaagaatc 
cagatggfcgc 
tgagagca_gg 
agccatgaac 
gaaagccatg 
tcacggacct 
gacaactagt 
cagaggaaga 
atttcatcjta 
ctatatctag 
taaggtttta 
tgacaattca 
aaggtgtttt 
attaaaatca 



atttgtctct 
gtttgatccc 
taacatgtat 
agttttctgg 
cagccgttcc 
agacaatgtc 
tgctggaagt 
taatattaat 
aatgtatgga 
gaactacttt 
agattatgaa 
agcaagacct 
gcctcgaggt 
acctttgccg 
tgaagcctta 
atctaatgct 
ccacatgcaa 
agaacgactg 
aactactatc 
gaaacttcag 
gacagaaacg 
ggagatgcag 
tactgaacct 
tactcaaaga 
accaccaatt 
taagcccgcc 
cacagtatct 
ccaggaacta 
aacaaggggt 
accaaatggt 
agagtctgaa 
atcccagctg 
tgacagtccc 
ccagaagcag 
cagctacatc 
attcaacaaa 
ctcctttttt 
tacttataaa 
atcatgtctg 
taaacaaatg 
gactttcaaa 
gaattctgta 
aaagcactaa 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 



<210> 160 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> 3SIM__O04856 



<400> 160 

caaagctttc cctatggttc aaagacaact agtattcaac aaaccttgta tagtgtatgt 60 

<210> 161 
<211> 153 6 
<212> DNA 

<213> Homo sapiens 
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<300> 

<308> NM_004900 



<400> 161 

acagagcttc 

ccacagatca 

aacgaaccca 

aggggccgct 

cctcagtacc 

tacaagtgtt 

aagctggccg 

ctctactact 

gcccgcgtga 

aatgaaggtc 

cgcacgctaa 

aataatgacc 

ctggacaatg 

aagaatcttc 

ccttctttgc 

ccctgcttct 

gtgagactgc 

ctgcaaatgc 

tactgctggg 

gaggagcaca 

tgaaggatgg 

gatcttcttc 

ccagcaaagc 

taattgaaaa 

agaatagttt 

cattaagaat 



aaaaaaagag 
gaaatccgat 
tcctctatgg 
caaatctcct 
acgcagaaat 
tccagatcac 
aattcctgtc 
actgggaaag 
cgatcatgga 
agcaattcat 
aggagattct 
ctttggtcct 
gcacctgggt 
t ctgtggctt 
agttggaccc 
cctggggctg 
gcatcttcgc 
tgcgggatgc 
acacctttgt 
gccaagccct 
gcctcagtct 
caagaaatgc 
aatgtgctcc 
tttctcttat 
tcaatgtatt 
cttccataat 



cgggacaggg 
ggagcggatg 
tcggagctac 
ttgggacaca 
gtgcttcctc 
ctggtttgta 
tgagcacccc 
agattaccga 
ctatgaagaa 
gccttggtac 
cagatacctg 
tcgacggcgc 
cctgatggac 
ttacggccgc 
ggcccagatc 
tgccggggaa 
tgcccgcatc 
tggggcccaa 
gtaccgccag 
gagtgggagg 
ctaaggaagg 
aaacagaccg 
tgatcaagta 
gttccaagtg 
aatgaagtga 
tgtttccaca 



acaagcgtat 
tatcgagaca 
acttggctgt 
ggggtctttc 
tcttggttct 
tcctggaccc 
aatgtcaccc 
agggcgctct 
tttgcatact 
aaattcgatg 
atggatccag 
cagacctact 
cagcacatgg 
catgcggagc 
tacagggtca 
gtgcgtgcgt 
tatgattacg 
gtctccatca 
ggatgtccct 
ctgcgggcca 
cagagacctg 
ttcaccacca 
gattttttaa 
tacaagagta 
ttaattggct 
aacact 153 



ctaagaggct 
cattctacga 
gctatgaag-t 
gaggccaggrt 
gtggcaacca 
cctgcccgga 
tgaccatct c 
gcaggctga_g 
gctgggaaaa 
aaaattatgc 
acacattcac 
tgtgctatg-a 
gctttctatg 
tgcgcttcfc. t 
cttggttca_t 
tccttcagga 
accccctata 
tgacctacga 
tccagccctg 
ttctccagaa 
ggttgagca-g 
tctccagctg 
aaatcagagt 
agattatgct 
ccatatttag 
6 



gaacatgaat 
caactttgaa 
gaaaataaag 
gtatttcaag 
gctgcctgct 
ctgtgtggcg 
tgccgcccgc 
tcaggcagga 
ctttgtgtac 
attcctgcac 
tttcaacttt 
ggtggagcgc 
caacgaggct 
ggacctggtt 
ctcctggagc 
gaacacacac 
taaggaggcg 
tgagtttgag 
ggatggacta 
tcagggaaac 
cagaataaaa 
ctcacagaca 
caattaattt 
caatattccc 
actaataaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 



<210> 162 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NMJD04900 
<400> 162 

tgctcacaga caccagcaaa gcaatgtgct cctgatcaag tagatttttt aaaaatcaga 60 

<210> 163 
<211> 1722 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM 004988 



<400> 163 

cgtagagttc 

acaggccaac 

ctgcctgtgg 

agtcatcatg 

ccaacaagag 

ggtcctgggc 

tcagggagcc 

gggttccagc 

ccgagcagta 

agccagggag 



ggccgaagga 
o cagaggaca 
g"tcttcattg 
tctcttgagc 
gccctgggcc 
accctggagg 
tccgcctttc 
agccgtgaag 
atcactaaga 
ccagtcacaa 



acctgaccca 
ggattccctg 
cccagctcct 
agaggagtct 
tggtgtgtgt 
aggtgcccac 
ccactaccat 
aggaggggcc 
aggtggctga 
aggcagaaat 



ggctctgtga 
gaggccacag 
gcccacactc 
gcactgcaag 
gcaggctgcc 
tgctgggtca 
caacttcact 
aagcacctct 
tttggttggt 
gctggagagt 



ggaggcaag-g 
aggagcacca 
ctgcctgctg- 
cctgaggaa_g- 
gcctcctcc t 
acagatcctc 
cgacagaggjc 
tgtatcctg~g 
tttctgctcc 
gtcatcaaaa 



ttttcagggg 
ag : gagaagat 
ccctgacgag 
cccttgaggc 
cctctcctct 
cccagagtcc 
aacccagtga 
agtccttgtt 
tcaaatatcg 
attacaagca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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ctgttttcct 
cgtgaaggaa 
ctatgatggc 
cctggtcatg 
gagtgtgatg 
gctcacccaa 
tcccgcacgc 
agtccttgag 
tgaagcagct 
tgggaggggg 
gtgtgacatg 
ttctgttcta 



cjagatcttcg 
gcagacccca 
ctgctgggtg 
attgcaatgg 
cjaggtgtatg 
cjatttggtgc 
tatgagttcc 
tatgtgatca 
t tgagagagg 
actgggccag 
aggcccattc 
fctgggtgact 



gcaaagcctc 
ccggccactc 
ataatcagat 
agggcggcca 
atgggaggga 
aggaaaagta 
tgtggggtcc 
aggtcagtgc 
aggaagaggg 
tgcaccttcc 
ttcactctga 
tggagattta 



tgagtccttg 
ctatgtcctt 
catgcccaag 
tgctcctgag 
gcacagtgcc 
cctggagtac 
aagggccctt 
aagagttcgc 
agtctgagca 
agggccgcgt 
agagagcggt 
tctttgttct 



cagctgg-tct 
gtcacctgcc 
acaggct tec 
gaggaaa tct 
tatgggg-cLgc 
eggcagg-tge 
gctgaaacca 
tttttct tec 
tgagttg-cag 
ccagcag-ctt 
cagtgtt etc 
cttttgg-oiat 



ttggcattga 
taggtctctc 
tgataattgt 
gggaggagct 
ccaggaagct 
cggacagtga 
gctatgtgaa 
catccctgcg 
ccagggccag 
cccctgcctc 
agtagtaggt 
tgttcaaatg 



660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 



ttttttttta 
acacagttct 
tccattctat 
aaaatgagca 
tgecttatae 
ttggcttctt 
aaaaaaaaaa 



etgggatggtt 
cjtgtatatag 
fcttgtgaatt 
gtaaaataga 
ctcagtctat 
tzgagaatgta 
aaaaaaaaaa 



gaatgaactt 
tttaagggta 
gggataataa 
tgagataaag 
tctgtaaaat 
agagaaatta 
aaaaaaaaaa 



cagcatccaa 
agagtcttgt 
cagcagtgga 
aactaaagaa 
ttttaaagat 
aatctgaata 
aaaaaaaaaa 



gtttatg-eiat 
gttttat tea 
ataagtactt 
attaagagat 
atatgeatae 
aagaatt ctt 
aa 172 2 



gacagcagtc 
gattgggaaa 
agaaatgtga 
agtcaattct 
ctggatttcc 
cctgttaaaa 



1380 
1440 
1500 
1560 
1620 
1680 



<210> 164 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_004L9 88 



<400> 164 

cagattggga aatccattct attttgtgaa ttgggataat aacagcagtg gaataagtac 



60 



<210> 165 
<211> 2334 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_004=994 



<400> 165 

agacacctct 

gggctgctgc 

cctgagaacc 

cactegggtg 

ccagaagcaa 

gcgaacccca 

caagtggcac 

ggcggtgatt 

caccttcact 

geaeggagae 

tggccccggc 

gggcgtcgtg 

catcttcgag 

ctggtgcagt 

gagactctac 

ccaaggccaa 

cgccaccacc 

etcgaeggtg 

gggtaaggag 

taccacctcg 



gccctcacca 
tttgctgccc 
aatctcaccg 
ejeagagatge 
ctgtccctgc 
oggtgcgggg 
caccacaaca 
gacgacgcct 
cgcgtgtaca 
cjg-gtatccct 
a_ttcagggag 
gttccaactc 
ggccgctcct 
accacggcca 
acccgggacg 
tcctactccg 
gccaactacg 
a-tggggggca 
tactcgacct 
aactttgaca 



tgagcctctg 
ccagacagcg 
acaggcagct 
gtggagagtc 
ccgagaccgg 
tcccagacct 
tcacctattg 
ttgcccgcgc 
gccgggacgc 
tegaegggaa 
acgcccattt 
ggtttggaaa 
actctgcctg 
actacgacac 
geaatgetga 
cctgcaccac 
acegggacaa 
acteggeggg 
gtaccagega 
gcgacaagaa 



gcagcccctg 
ccagtccacc 
ggcagaggaa 
gaaatctctg 
tgagctggat 
gggcagattc 
gatccaaaac 
cttcgcactg 
agacategtc 
ggaegggetc 
cgacgatgac 
cgcagatggc 
caccaccgac 
cgacgaccgg 
tgggaaaccc 
ggaeggtege 
gctcttcggc 
ggagctgtgc 
gggccgcgga 
gtggggcttc 



gtcctggtgc 
cttgtgc tct 
tacctgta.ee 
gggcctgcgc 
agcgccacgc 
caaacct ttg 
tacteggaag 
tggagegogg 
atccagt ttg 
ctggcacacg 
gagttgtg-gt 
gcggcctgcc 
ggtcgctccg 
tttggct tct 
tgccagtttc 
tccgacggct 
ttctgcccga 
gtcttccoct 
gatgggeg-cc 
tgcccggacc 



tcctggtgct 
tccctggaga 
gctatggtta 
tgetgettet 
tgaaggecat 
agggegaect 
acttgccgcg 
tgacgccgct 
gtgtcgcgga 
cctttcctcc 
ccctgggcaa 
acttcccctt 
acggcttgcc 
gccccagcga 
cattcatctt 
accgctggtg 
cccgagctga 
tcactttcct 
tctggtgcgc 
aaggatacag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 
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tttgttccto gtggcggcgc atgagttcgg ccacgcgctg ggcttacjatc attcctcagt 12 60 

gccggaggccj ctcatgtacc ctatgtaccg cttcactgag gggcccccct tgcataagga 132 0 

cgacgtgaat ggcatccggc acctctatgg tcctcgccct gaacctgagc cacggcctcc 13 80 

aaccaccacc acaccgcagc ccacggctcc cccgacggtc tgccccaccg gaccccccac 1440 

tgtccacccc tcagagcgcc ccacagctgg ccccacaggt cccccctcag ctggccccac 15 00 

aggtcccccc actgctggcc cttctacggc cactactgtg cctttgagtc cggtggacga 1560 

tgcctgcaac gtgaacatct tcgacgccat cgcggagatt gggaaccagc tgtatttgtt 162 0 

caaggatggg- aagtactggc gattctctga gggcaggggg agccggccgc agggcccctt 1680 

ccttatcgcc gacaagtggc ccgcgctgcc ccgcaagctg gactcgcjtct ttgaggagcc 1740 

gctctccaag aagcttttct tcttctctgg gcgccaggtg tgggtgfcaca caggcgcgtc 1800 

ggtgctgggc ccgaggcgtc tggacaagct gggcctggga gccgaccjtgg cccaggtgac 1860 

cggggccctc cggagtggca gggggaagat gctgctgttc agcgggcggc gcctctggag 192 0 

gttcgacgtcj aaggcgcaga tggtggatcc ccggagcgcc agcgaggtgg accggatgtt 1980 

ccccggggtcr cctttggaca cgcacgacgt cttccagtac cgagagaaag cctatttctg 2 040 

ccaggaccgc ttctactggc gcgtgagttc ccggagtgag ttgaaccagg tggaccaagt 2100 

gggctacgtcr acctatgaca tcctgcagtg ccctgaggac tagggctccc gtcctgcttt 2160 

gcagtgccat gtaaatcccc actgggacca accctgggga aggagccagt ttgccggata 2220 

caaactggta ttctgttctg gaggaaaggg aggagtggag gtgggctggg ccctctcttc 2280 

tcacctttgt tttttgttgg agtgtttcta ataaacttgg attctctaac cttt 2334 

<210> 166 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_0 04994 
<400> 166 

ggccctctct tctcaccttt gttttttgtt ggagtgtttc taataaactt ggattctcta 60 

<210> 167 
<211> 5329 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> Modif ied_base 
<222> 1 . . . 5329 
<223> n = SL,c,g, or t 



<300> 

<308> NM_0O5063 



<400> 167 

gtggtgtcgg 

ctgttctcct 

tcgcactttg 

cacccgttgc 

cgccgacaac 

accgcagtcc 

cccggcatcg 

ataccaccac 

agttggagac 

tatatgaccc 

gaaacatcat 

ctacctgcaa 

gcataacagc 

tacggctctt 

ctcgtgacca 



tgtcggcagc 
ccccctcccg 
cccctgcttg 
cagctctagc 
cagctagcgt 
tccggcgacc 
gagagccaag 
caccaccatt 
gatgcccctc 
cacctacaag 
ccttatgtct 
gttctacacc 
engage teat 
tctgatcatt 
ccgtgcccac 



atccccggcg 
cccttacctc 
geageggata 
ctttaaattc 
gcaaggcgcc 
ccgaactccg 
atgccggccc 
acagcgcctc 
tacttggaag 
gataaggaag 
ctgctacact 
tggctttggg 
cgtctgtgga 
gccaacacaa 
cacaagtttt 



ccctgctgcg 
cacgcgggac 
aaagggggct 
ccggctcggg 
gcggctcagc 
ctccggagcc 
acttgetgea 
cctccagggt 
acgacattcg 
gcccaagccc 
tgggagcect 
gggtattcta 
gccaccgctc 
tggcattcca 
cagaaacaca 



gtcgccgcjag 

cgcccgcgcc 

gaggaaatac 

acctccacgc 

gcgtacccjgc 

tcagccccct 

ggacgatatc 

ectgeagaat 

ccctgatata 

caaggttcjaa 

gtatgggatc 

ctattttcftc 

ttacaaagjet 

gaatgatgrtc 

tgetgatect 



ccctcggcct 
agtcaactcc 
cggacacgtc 
acegggctag 
gggcttcgaa 
ggaaagtgat 
tctagctcct 
ggaggagata 
aaagatgata 
tatgtctgga 
actttgattc 
agtgccctgg 
cggctgcccc 
tatgaatggg 
cataattccc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 
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gacgtggctt 

aagagaaggg 

agaggaggta 

cctggtattt 

atgctgtggt 

gtccttatga 

gtgagggctt 

gctggcacat 

atgaccggaa 

gaaactacaa 

gcagaggttt 

atgatgttaa 

aactctgcct 

ggcccattgt 

gcagctggtc 

ctaaaacgaa 

agtaggtggc 

ttcagcccag 

aaagcnaggt 

cgaaggaacc 

ccaccacata 

ggcaatgcta 

taagtttgta 

taacaaggag 

tggtaaaaac 

tcctctttct 

ctgtggcatt 

aaataaaata 

ttccaaagag 

atttcctctc 

ttctaagagc 

tactgagtga 

tccagtttct 

ttcactcatt 

aactgcctgg 

taccctgtct 

ccctggacag 

agctccctcc 

agagctagct 

cctgccacac 

agaaagcctc 

ggccagccac 

agggcatgct 

cagctggcag 

gggcaagaac 

ctgcaaagct 

tctgttcagc 

tggaaaaaaa 

catcaaagca 

cagtccactg 

gtagaggcta 

agtagaaaag 

cccttggaaa 

tatataaaca 

atctgtacca 

gtgtggctgt 

tctaaagctc 

tttctaacag 

tttctaatca 

cctgagaccc 

cacgatttgc 



tttcttctct 
gagtacgcta 
ctacaaacct 
ctggggtgaa 
gcttaatgcc 
caagaacatt 
ccacaactac 
caacttcacc 
gaaagtctcc 
gagtggctga 
taatgtctgt 
cccattccag 
ttatgatgct 
cctccttttc 
agtctttgct 
tggtctttgc 
ttgagctaga 
gcttttgcta 
aaattgtcgg 
tctcgccatg 
gcatgcttcc 
attcaatgcc 
gtaaaagtgg 
atttcttagt 
agcagctcat 
gctctgccat 
tccaaacatc 
tatatacata 
ggatgtttgg 
cactgctgga 
gaacttcaca 
gttattggga 
ccttgaaatg 
tttttttgag 

ggggcagggt 

gtcccttttc 
gcagcctcct 
tgcacacaga 
gccactttca 
tgatggctca 
tctctccctc 
agagcaaaag 
gaatgccccc 
tgcaatgctt 
ccaagtgcct 
ttggctgaga 
aaactaacca 
caaaaacaga 
gaagtctggc 
ttgagcccca 
caggggttag 
gatctaaggg 
tgtctgctgg 
ataaaaacaa 
tgaccctaca 
gggtgtgggt 
aattcaagtg 
tccttgcttt 
cagtgttgca 
tactgaagtg 
aggattccct 



cacgtgggtt 

gacttgtctg 

ggcttgctgc 

acttttcaaa 

acctggctgg 

agcccccggg 

caccactcct 

acattcttca 

aaggccgcca 

gtttggggtc 

ttattaacta 

tacagtattc 

aagctgatat 

actttaatcg 

cagtgtccag 

tccagataac 

gataaaacag 

gatggaatgg 

gggagagagt 

atcagacata 

tttctctcct 

gcaacatata 

tctctgctgg 

tcatatatca 

agaattttga 

cttcaggata 

caaaaaaagg 

tatacattgc 

aaaaaactct 

catgagatgg 

taggaaggga 

gtcttaataa 

agtaaaaact 

gagaaggggg 

taggaatctc 

tttgaccaga 

ttgtgtgtat 

atgctcaggg 

cgtggcctcc 

agacaaggct 

cctctctcat 

agggtttatt 

tgcttacttg 

gtagaagtag 

cacctcgaaa 

cctgggattt 

gcattcccta 

atttgagaac 

tttgctctat 

gtgctggaag 

cctggactaa 

aagatcactg 

tatttctaat 

ctttcacttc 

taaggctgga 

gggagtgtgt 

acacattaat 

ggggggtgtg 

tatgagcctg 
gctggtagaa 
tctgggcttc 



ggctgcttgt 
acctagaagc 
tgatgtgctt 
acagtgtgtt 
tgaacagtgc 
agaatatcct 
ttccctatga 
ttgattgcat 
tcttggccag 
cctcaggttc 
ctgaataatg 
ttttaaaatt 
tatttcttct 
ccctcctttc 
cttccaaagc 
tctctttcct 
aatcttctgg 
aaaagcaact 
tagcatgtat 
cagctgccta 
ggctcggggt 
gttgaggccg 
ggaagggttt 
agaagtcttg 
gtattccatg 
ttggttcttc 
gaaggattta 
ttagaacgtt 
gaaggagagg 
agaggctgag 
tctgagaaca 
actagatatt 
agaaggcttc 
tctctgttaa 
ttcactaccc 
tctttctctt 
tcagaggcag 
tcactgaacc 
gcagtgtctc 
ggcaaaccct 
gagaagccaa 
ttcagtcccc 
gtgagggtgc 
gaggaaacag 
ggaggccctg 
gagataccac 
cagcctaggg 
cttggaccac 
taagattgga 
ggaggaaggc 
aggcatcctt 
tagtttagtt 
tccacaggtc 
ttcctattgt 
tggcacctca 
ctgctgagta 
gataaactca 
ctggcaactt 
ccctcactcc 
aaaggggcct 
attctggaaa 



gcgcaaacac 

tgagaaactg 

catcctgccc 

cgttgccact 

tgcccacctc 

ggtttcstctt 

ctactctgcc 

ggccgccctc 

gattaaaaga 

ctttttcaaa 

ctaccaggat 

caaaagtatt 

cttatcctct 

ccttattgcc 

ctagacaacc 

tgagctg-ttg 

gtagtcccct 

tcatttgaca 

gaatgtaagg 

cctaatgagg 

aaaaagfcggc 

aggataaaga 

tcttttcttt 

aagttgg-gtg 

agctgctcat 

ccctcatagt 

aggaggtzgaa 

aaactattag 

aggaattagt 

ggacaggatc 

cgttcag-ggg 

aggtccattc 

tctccacagt 

catctagcct 

tgattcttga 

ccctgaacgt 

tgatgacttg 

actgcttctc 

cacctacacc 

cccagaaaca 

gcgctca-tgt 

tctctctggg 

cccgcctgag 

ttctcactgg 

ttccctg-gag 

aaaccctgct 

cagacaatag 

tcctgtccct 

aatgtacact 

ctttctt ctg 

gtctttg-agc 

ctgttga_cct 

atcagatgcc 

aatcgtg-tgc 

ggctgag-g-gc 

aggaacacga 

gatctgatca 

agctcag-^tg 

ctctgcagaa 

gagtgga-gga 

cttttgt tag 



ccagctgtca 

gtgatgttcc 

acgcttgtgc 

ttcttgcgat 

ttcggatatc 

ggagctgtgg 

agtgagtacc 

ggtctggcct 

accggagatg 

aaccagccag 

gctaaagatg 

gaaagccaac 

ctctcttcta 

tcccaggcaa 

tttctgtagc 

tgagctttga 

gttgattatc 

caaagcttct 

atgagggaag 

acttcaagcc 

tgcggtgttt 

aaagacattt 

ttttctttaa 

tttccagaat 

tacagttctt 

aataagatgg 

gtcgggtcaa 

agtatttccc 

tgggatgcca 

tataggcagc 

ttgagaaggt 

attaattagt 

gttgtgcccc 

aaagtataca 

ttcctggctc 

tttcttcttt 

ctgtccaggc 

ttttgaaagt 

cctgtgctcc 

tctctggccc 

tgagccagtg 

tcagaaccag 

tcagtgctct 

gaagaagcaa 

tcagggtgaa 

gaacacagtg 

tatagaagtc 

gtagctcagt 

accaaacact 

tgttaattgc 

tattcacctc 

gtgcacctac 

tgcttgataa 

catggatctg 

cccaatgtat 

ttttcaagat 

agagtccgga 

ccttacatct 

tccctttgca 

ttatcagtat 

ggctgctttt 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 
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cttaagtgcc 
tttttttttt 
gtttgctacg-t 
gt taaaaaag 
cactgact tg 
tctgttaa<a.a 
aatgcttt tt 
tttttatasit 
ttagtctata 
gccagacaaa 
cttttctg-fcg 
acaagaataa 
tgagattgtg 



cacatttgat 
tttgggttaa 
atcttgggtg 
cgaggtggcc 
ctgtgtgacc 
tggggataat 
agaaagcatt 
aggaatgtcc 
tggttctcca 
atttgagaat 
attgggtggg 
ttttggaata 
tttgttcata 



ggagggtgga 
aagatggttg 
tattctctgt 
atgttatgct 
ctgggcaagt 
aatactgacc 
ttgggatcct 
accatgaact 
agaaactgaa 
acataaacaa 
attttttccc 
atttctatta 
ataaaagtga 



aataatttga 
tagcatttaa 
aagtgtagct 
ggtggttgcc 
cacttaacta 
tacctcaaag 
tcagcacagg 
tgatacgtcc 
tgaatccatt 
cgcattgcca 
tttttatgtg 
atatcaactc 
agtgaatctg 



atgtatttga 
aatggaaaat 
caaataggtc 
agggcctcca 
taaggtgcct 
ggcagttt tg 
aattctcaag 
gtgtgtccca 
ggagaagc gg 
cggaaacata 
ggatatag-ta 
tgaagctaat 
attgcactg 



tttataagtt 
tttctccttg 
atcatgaaag 
accactgtgc 
cagttttcct 
aggcatgact 
acctgagtat 
gatgctgtca 
tggataacta 
cagaggatgc 
gttacttgtg 
tgtactaatc 
5329 



4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 



<210> 168 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005063 
<400> 168 

aataatgcta ccaggatgct aaagatgatg atgttaaccc attccagtac agtattcttt 6 0 

<210> 169 
<211> 634 
<212> DNA 
<213> Homo sapiens 



<300> 
<308> NM_ 



.005101 



<400> 169 

cggctgagsig 

ccacagccca 

caggtgtccc 

attggcgtg-c 

gacagggtcc 

gacaaatgcg 

tacgaggtcc 

ggtgtgcag-g 

ccgctgggcg-g 

ggaggcggca 

caagggccg-g 



gcagcgaact 
cagccatggg 
tgagcagctc 
acgccttcca 
cccttgccag 
acgaacctct 
ggctgacgca 
acgacctgtt 
agtacggcct 
cagagcctgg 
aaataaaggc 



catctttgcc 
ctgggacctg 
catgtcggtg 
gcagcgtctg 
ccagggcctg 
gagcatcctg 
gaccgtggcc 
ctggctgacc 
caagcccctg 
cgggcggagc 
tgttgtaaga 



agtacaggag 
acggtgaaga 
tcagagctga 
gctgtccacc 
ggccctggca 
gtgaggaata 
cacctgaagc 
ttcgagggga 
agcaccgtgt 
taagggcctc 
gaat 634 



cttgtgcccjt 
tgctggcg^g 
aggcgcagat 
cgagcggtg-t 
gcacggtcct 
acaagggc eg 
agcaagtgag 
agcccctgga 
tcatgaatct 
caccagcatc 



ggcccacagc 
caacgaattc 
cacccagaag 
ggcgctgcag 
gctggtggtg 
cagcagcacc 
egggctggag 
ggaccagctc 
gcgcctgcgg 
cgagcaggat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 



<210> 170 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005101 
<400> 170 

tggtggtgg-a caaatgegae gaacctctga gcatcctggt gaggaataac aagggecgea 60 

<210> 171 

<211> 1339 

<212> DNA. 

<213> Homo sapiens 
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<30O> 

<308> NM 005139 



<40O> 171 

gaattccgat 

ggttggacac 

agctattcag 

gactgagagg 

aaaggagctg 

ggccctagtg 

cgcgggaaca 

gaaggatatc 

ttccgaaaca 

tgaatagtctg 

tggtgagaac 

ctttcctcaa 

ggacagcata 

ttgtgtgagg 

tggaactgat 

ggacattcga 

ggatacttct 

caagaagata 

ctcatactat 

ttcattgttc 

atgtagcagc 

ttatatttct 

agataataaa 



tagtgtgatc 
cgaggaacag 
aaagcaatca 
tcaaatgcac 
aaagatgact 
actccaccag 
aacgaagatg 
tctcaagcct 
tctggtgact 
aaagtggatg 
agatggggca 
ttaaaactaa 
aaaggagaat 
aacacgccgg 
gagtttactc 
acagagttca 
ggagactatg 
atctccaaag 
ttaagagaac 
tataacaaca 
tcataaatga 
gctttcaaag 
acggaattc 



tcagctcaag 
taagagatta 
gaggaattgg 
agcggcagct 
tgaagggtga 
cagtctttga 
ccttgattga 
attatacagt 
tccggaaagc 
agcatctggc 
cggatgaaga 
catttgatga 
tatctgggca 
cctttttagc 
tgaaccgaat 
agaagcatta 
aaatcacact 
gtccacgatg 
aagcaaatat 
acaacaaaag 
aattgaaaat 
ttaagaatct 
1339 



gcaaaggtgg 
tccagacttt 
aactgatgag 
gattgttaag 
tctctctggc 
tgcaaagcag 
aatcttaact 
atacaagaag 
tctgttgact 
caaacaagat 
caaattcact 
atacagaaat 
ttttgaagac 
cgaaagactg 
aatggtgtcc 
tggctattcc 
cttaaaaatc 
ggctttccca 
aaacagcaac 
cgattattat 
ggtattaaag 
ttatagttct 



gatatcatgg 
agcc catcag 
aaaatgctca 
gaatatcaag 
cact ttgagc 
ctaaagaaat 
accaggacaa 
agtc ttggag 
ttgg-cagatg 
gcccagattc 
gagatcctgt 
atcagccaaa 
ttac tgttgg 
catcgagcct 
agat cagaaa 
ctatattcag 
tgtg-g-tggag 
acagctccac 
ttgtgttcct 
tttagagcat 
atctgcaact 
actccattaa 



catctatctg 
tggatgctga 
tcagcattct 
cagcatatgg 
atctcatggt 
ccatgaaggg 
gcaggcaaat 
atgacattag 
gcagaagaga 
tctataaagc 
gtttaaggag 
aggacattgt 
ccatagttaa 
tgaagggtat 
ttgacctttt 
caattaaatc 
atgactgaac 
cttacttctt 
aacaggaatt 
ctcatttata 
actatccaac 
atataaagca 



<210> 172 

<2i:L> 60 

<212> DNA 

<2 13 > Homo sapiens 

<30O> 

<308> NM_005139 
<40O> 172 

ttcagcaatt aaatcggata cttctggaga ctatgaaatc acactcttaa aaatctgtgg 

<210> 173 

<21IL> 1582 

<212> DNA 

<213> Homo sapiens 

<30O> 

<308> NM 005165 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 



60 



<40O> 173 

ccgagctgtg 

ctcactcgta 

ggattgtagc 

agccjgctgag 

tcctgttcag 

agaccctcta 

gcatcgtcgt 

aaaccaccac 

gtgctgactt 

ttgccattct 

gcattgtgcc 

gtcsLgtatgt 



cttgtggctg 
cccagccctt 
cccgggcaaa 
ccaaattggg 
tgctgatgac 
ccagaaagat 
gggcatcaag 
tcaagggctg 
tgccaagtgg 
ggagaacgcc 
tattgtggaa 
tacagagaag 



cggctgctaa 
tctgctgagc 
ggcattctgg 
gtggaaaaca 
cgtgtgaaaa 
gataatggtg 
gttgacaagg 
gatgggctct 
cgctgtgtgc 
aacgtgctgg 
cctgaaatat 
gtcttggctg 



ctggctgcgc 
agaagaagga 
ctgcggatga 
cagaggagaa 
agtgcattgg 
ttcccttcgt 
gtgtggtgcc 
cagaacgctg 
tgaaaatcag 
cccgttatgc 
tgcctgatgg 
ctgtgtacaa 



acagggagct 
gttg"tctgac 
gtct^taggc 
ccgccggctg 
aggcgtcatt 
ccgaaccatc 
tctag-ctggg 
tgcccaatac 
tgagcgtaca 
cagtatctgc 
agacoacgac 
ggccctgagt 



gtcaccatgc 
attgccctgc 
agcatggcca 
taccgccagg 
ttcttccatg 
caggataagg 
actgatggag 
aagaaggatg 
ccctctgcac 
cagcagaatg 
ctcaaacgtt 
gaccatcatg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 
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tatacctgga 
tcaagtatac 
ccccagctgt 
tcaacctcaa 
atgggcgtgc 
gggctgccac 
agtatgaagg 
atgcctactg 
tgcttcg-tagt 
ccaacttgtc 
ggatg^gagg 
caggatggct 
gcttctccct 
aagccctgac 
aaaaaaaaaa 



ggggaccctg 
cccagaggag 
cccaggagtg 
tgccatcaac 
cctgcaagcc 
tgaggagttc 
cagtggagaa 
agtatccact 
catggccagg 
ttcctttctc 
atagggagcc 
gggtctcccc 
gggctctcct 
tcatgccttg 
aaaaaaaaaa 



ctcaagccca 
attgccatgg 
accttcctgt 
cgctgccccc 
tctgcactca 
atcaagcggg 
gatggtggag 
ccataccaca 
gccaaatagc 
tcttcccttc 
cctcatgact 
ctacctcttc 
tcttgcctgc 
agtacatacc 
aa 1582 



acatggtgac 
caactgtcac 
ctgggggtca 
ttccccgacc 
atgcctggcg 
ctgaggtgaa 
cagcagcaca 
gcccttggcc 
tatgcagagc 
ccctctctca 
gagggcagaa 
cagctcccac 
cctgtctcct 
atacagcaaa 



cccgggccat 
tgccctgcgt 
gagcgaagaa 
ctgggcgctt 
agggcaacgg 
tgggcttgca 
gtcactctac 
cagccatctg 
cLgagatgcct 
ttgctgcacc 
gaaattgcta 
aattttccca 
cj-ggatcagag 
taaatggtag 



gcctgtccca 
cgcactgtgc 
gaggcatcat 
accttctcct 
gacaatgctg 
gcccagggca 
attgccaacc 
cacccacttt 
tcacctggca 
tgggaccata 
gaagtcagaa 
tgatgaggta 
ggtagtacag 
caaaacaaaa 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 



<210> 174 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005165 

<400> 174 

gagggtagta cagaagccct gactcatgcc ttgagtacat accatacagc aaataaatgg 6 0 

<210> 175 

<211> 451 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005213 



<400> 175 

acttccctgt 

atacctggag 

aaggttaaac 

cagtataaaa 

aataaatata 

cttactggat 

gtacccaaag 

ataaccatca 



tcactttggt 
gcttatctga 
cacagcttga 
ctcaagttgt 
tgcacttgaa 
accaggttga 
tgttctgatt 
ataaagaagc 



tccagcatcc 
ggccaaaccc 
agaaaaaaca 
tgctggaaca 
agtattcaaa 
caaaaacaag 
ccttcaactg 
attcttttcc 



tgtccagcaa 
gccactccag 
aatgagactt 
aattactaca 
agtcttcccg 
gatgacgagc 
gctactgagt 
a 451 



agaagcaatc 
aaatccagga 
atggaaaatt 
ttaaggtacg 
cjacaaaatga 
tgacgggctt 
catgatcctt 



agccaaaatg 
gattgttgat 
ggaagctgtg 
agcaggtgat 
ggacttggta 
ttagcagcat 
gctgataaat 



60 

120 

180 

240 

300 

360 

420 



<210> 176 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM__005213 

<400> 176 

aactggctac tgagtcatga tccttgctga taaatataac catcaataaa gaagcattct 60 



<210> 177 

<211> 366 

<212> DNA 

<213> Homo sapiens 
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<300> 

<308> NM_005218 



<400> 177 

gtcagctcag cctccaaagg agccagcctc 
tgccagtcgc catgagaact tcctaccttc 
agatggcctc aggtggtaac tttctcacag 
gcgtcagcag tggagggcaa tgtctctatt 
gcacctgtta cagagggaag gccaagtgct 
gacgcagaag tgaaatgaac ttfcttataag 
aagtat 3 66 



tccccagttc 
tgctgtttac 
gccttggcca 
ctgcctgccc 
gcaagtg-agc 
cattctttta 



ctgaaatcct 
tctctgctta 
cagatctgat 
gatctttacc 
tgggagtgac 
ataaaggaaa 



gagtgttgcc 60 

cttttgtctg 120 

cattacaatt 180 

aaaattcaag 240 

cagaagaaat 3 00 

attgcttttg 3 60 



<210> 178 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005218 



<400> 178 

gggagtgacc agaagaaatg acgcagaagt gaaatgaact ttttataagc attcttttaa 60 



<210> 179 

<211> 1519 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_005326 



<400> 179 

ctgcctcgga 

cccgggcggc 

aggctcgcgg 

gtctcccggc 

attggttggc 

ctgattggct 

gcccggcccg 

tgggagccgc 

cagatttgcg 

ccctgaccga 

tggatccggt 

ccacagtgct 

agctggagtc 

agatcactca 

cgtgccacac 

ctgccgtgtt 

ctgcggatga 

gagtctactg 

ccggcaatgc 

agcccacagt 

gggagaagac 

tgcgcaggga 

cggatttggg 

tcagtcttga 

catatttata 

aaaaaaaaaa 



acgctgtccc 
gtggcgctcg 
cacagctctc 
ccgtggcagt 
ggacggtgcg 
gcgacgcgcc 
ggtcatggtg 
ctgcgcccgc 
gaagaacctg 
caactacatg 
gcagccccag 
caccacccac 
gggactgaag 
cctgtccaca 
ttcaggacac 
cacaggtgac 
gatgtgtaaa 
tggccacgag 
cgccatccgg 
gccatccacc 
ggtgcagcag 
gaaggaccag 
gattaggctc 
tttaacctta 
agagaagttt 
aaaaaaaaa 



ccgcagcgac 
gcctccaggg 
ccggcgcagc 
cggggctcgc 
aggtggacgc 
gacgccggtg 
gtgggccgag 
cgaggcctcg 
accgtggacg 
tacctggtca 
aaggtcgtgg 
caccactggg 
gtgtacgggg 
ctgcaggtgg 
atttgttact 
accttgtttg 
gctctgctgg 
tacaccatca 
gagaagctgg 
ctggcagagg 
cacgcaggtg 
ttcaagatgc 
ttttaggtaa 
attttacagc 
aacaagtatt 
1519 



ggcccgfctcc 
atccactgtg 
tctcggcjccg 
ggacaaaaca 
tgattgg-ctg 
ttttgcagtc 
ggctgctcgg 
gtccagccct 
agggcaccat 
ttgatgeitga 
acgcggcgag 
accatgctgg 
gtgacga.ccg 
ggtctctgaa 
tcgtgaojcaa 
tggctg^ctg 
aggtcttggg 
acaacctcaa 
cctgggccaa 
agtttaccta 
agacggaccc 
cccggga.ctg 
ctggctttcc 
ccttggcttg 
tattcccata 



acctcgcgat 
cggtgccaaa 
ccgccgccgc 
agttgagcgc 
agggcagcgc 
ctgggcagct 
ccgccgcagc 
gctgggagtt 
gaaggtagag 
gaccaaggag 
aaagcacggg 
cgggaatgag 
tatcggggcc 
cgtcaagtgc 
gcccggaggc 
cgggaagttc 
ccggctcccc 
gtttgcacgc 
ggagaagtac 
caaccccttc 
ggtgaccacc 
aggccgccct 
tgctggtccg 
tgttatcgga 
aaaaaaaaaa 



ctgccgggta 
aaagaggcgg 
tcccaggccc 
gagcgcgttg 
gaggcgggcg 
cggcagtcca 
ctcgccgcgc 
ttctgccaca 
gtgctgcctg 
gctgccattg 
gtgaaactga 
aaactggtca 
ctgactcaca 
ctggcgaccc 
tcggagcccc 
tatgaaggga 
ccggacacaa 
cacgtggagc 
agcatcgggg 
atgagagtga 
atgcgggccg 
gcaccttcag 
tgcgggaaat 
cattctaatg 
aaaaaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 



<210> 180 
<211> 60 



81 



WO 2005/086891 



PCT/US2005/007894 



<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_0 0532 6 
<400> 180 

cttgtgttat cggacattct aatgcatatt tataagagaa gtttaacaag tatttattcc 60 

<210> 181 

<211> 3378 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005461 
<400> 181 

acagctgcac cgccgagctg cgagcggctg cgagcgagag agcgtaagag caagagagct 60 

agagagcgag caacgggcac tcgccccacg cctcccctca gccccaccgc gcgctccgct 12 O 

tgcctctcca ccccgcccga ctctacccgg cccggtccct gcgcgggcac agcccagagc 18 O 

tctggggcgg tgcaggcagc ctcgggactc tccggcgcgc cgccgcgtcc ccagacaaag 24 0 

gcttggccgg cggccccggc ccgctgcgcc ctcgctcccc gcctccccag ctcttctccg 30 O 

ctcttccccc ccgcgcttgg ctcggcgcgc tccggccggc cgcaaagttt cccgggcggc 36 0 

agcggcggct gcgcctcgct tcagcgatgg ccgcggagct gagcatgggg ccagagctgc 42 0 

ccaccagccc gctggccatg gagtatgtca acgacttcga cctgctcaag ttcgacgtga 48 0 

agaaggagcc actggggcgc gcggagcgtc cgggcaggcc ctgcacacgc ctgcagccag 54 0 

ccggctcggt gtcctccaca ccgctcagca ctccgtgtag ctccgtgccc tcgtcgccca 60 0 

gcttcagccc gaccgaacag aagacacacc tcgaggatct gtactggatg gcgagcaact 66 0 

accagcagat gaaccccgag gcgctcaacc tgacgcccga ggacgcggtg gaagcgctca 72 0 

tcggctcgca cccagtgcca cagccgctgc aaagcttcga cagctttcgc ggcgctcacc 78 0 

accaccacca tcaccaccac cctcacccgc accacgcgta cccgggcgcc ggcgtggccc 84 0 

acgacgagct gggcccgcac gctcacccgc accatcacca tcatcaccaa gcgtcgccgc 90 0 

cgccgtccag cgccgctagc ccggcgcaac agctgcccac tagccacccc gggcccgggc 96 0 

cgcacgcgac ggcctcggcg acggcggc^g gcggcaacgg cagcgtggag gaccgcttct 10 2 0 

ccgacgacca gctcgtgtcc atgtccg-tg-c gcgagctgaa ccgccacctg cggggcttca 10 80 

ccaaggacga ggtgatccgc ctgaagcaga agcggcggac cctgaagaac cggggctacg 1L4 0 

cccagtcttg caggtataaa cgcgtccagc agaagcacca cctggagaat gagaagacgc 12 0 0 

agctcattca gcaggtggag cagcttaagc aggaggtgtc ccggctggcc cgcgagagag 12 6 0 

acgcctacaa ggtcaagtgc gagaaactcg ccaactccgg cttcagggag gcgggctcca 13 2 0 

ccagcgacag cccctcctct cccgagttct ttctgtgagt cgtggccggt cctggccccc 13 80 

gcccttgccc cggcccggac tccctgtccc acgtccctag tcccagacta ccccggaccc 14=40 

tgtccctgcc gcggccccag ccttgacctg tttgacttga gcgagaggga ggaagggcgc 15 0 0 

gcgggccgcg ggcgacgggc gggtgcgcgg gcgggcaggg gaccttggct aaggcgagag 15 6 0 

tagcgcacgc cagcgccgcc tcctagactc gagcagagcc ggagagagag acgagagggt 16 2 0 

gggaggtccc ggagtaactt ctctccaggc tgaagggcgg cgaggcatag tcccgagaag IS 80 

tcaccaaggc catctggaga ctcctggctt tctgaacttt gcgcgttaag ccgggacagc 17 40 

tgctttgctg cccggagagt agtccgccfcc aggaagagag caacgaggaa aggagaggga IS 0 0 

ctctggcgtc ccggcaggcg agaggcga.gg ctgagcgaaa gaaggaagga cagacggacc IS 6 0 

tgtctgtcag agttcggaga acactggctc tcagccctga gacacaggcc tcagttagga 19 2 0 

cgctcggcgc ccaaatctca tcagttfcta.t tgcctgctcg attatataga aaaatacaaa 19 80 

aaatctgcat taaaaatatt aatcctgcat gctggacatg tatggtaata atttctattt 2D 40 

tgtaccattt tcttgtttaa ctttagca.tg ttgttgatca tggatcatac tccccttgtt 21-0 0 

tctttgggtg agaagggatc gcagtttg-ga aactccggcg gctgcgtgcg gggtttcagt 21-60 

cccagctgta ggcttgtaaa tacccgcccc gccaaaccgc atagagaacg tggcagcaag 22 2 0 

ctgagggtct ttgtttgggt ttattattac ggtatttttg tttgtaagtt aaaaagaaaa 22 80 

aaaaaaagaa aaagttccgg gcattttcjca tcagaaaaca actttgtctt ggggcacact 23 40 

tggaagttgc atgttttctt tccttccctt atccccattc ggtcctcttt ttcctctctc 24=00 

gctttagttt tcaaccttgt tggtgctg-a.g agagagaacc gagaggtccc agtacaaggg 24=60 

cagggcaggg cagggaagct gccaagctcc gcaccccaga ggagtgttct ggactacagc 25 2 0 

cttgtcttat ggtcaaattg atacccttaa taagaaagga aaggaaagga aaacagatcc 25 80 
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tcccctctgc 
ctgcgctaat 
tggcagtgca 
gaaacccaaa 
ttgcgtttct 
tcaattcttc 
tttttttttt 
aaagtgtatt 
agtttgtatt 
gcagttttaa 
ttcaactgtg 
agggatcaat 
tttattcaga 
aaaaaaaaaa 



tttttattgt 
tgtaggag-cc 
ctggtaagca 
cattggcaag 
ttccctgctt 
agttgtatag 
ctgcaggt ca 
tttaagttcc 
agttgataaa 
atattgtg-ga 
ttgaaatcaa 
ttcaaaccct 
ctggtttotg 
aaaaaaaa 3 



aaccagaatc 
acagcgctcc 
gacactgcac 
taattttgca 
ttgagatagt 
tagttatagg 
gtaaaaggat 
catttgaaat 
ttaacagtaa 
tatttgtcct 
acctgccgca 
gcttatgata 
ttttttggtt 
378 



accctgaggt 
tagggtgaga 
tgaaccaact 
actttcaagt 
aggaagagtt 
gaagatatgg 
ttaagttgca 
tgctggcgct 
taacaagatt 
gcatcagaaa 
acagaaattg 
tgaaaatatt 
attaaaatgg 



cccttctgaa 
ggcttagcca 
gctatgctca 
gcgttcttta 
cttggtggtg 
gtgtttttct 
ctgacaaaaa 
gctggccgga 
gtatgaaccg 
cgagctttgg 
tttttatttc 
aaaacctagt 
tttcctattt 



ccctc-bgggc 
tccctgaccc 
gaatgtacca 
gaccaatgca 
tcccccccct 
ttatt<eLttac 
taccaaaata 
tgcat "btttg 
catgg~tgctt 
ttttteLcaga 
atgtaaaata 
ctattcjtagt 
tgcttattaa 



2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 



<210> 182 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005461 



<400> 182 

atttgtcctg catcagaaac gagctttggt ttttacagat tcaactgtgt tgaaatcaaa 6 0 



<210> 183 
<211> 597 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_005532 



<400> 183 

agctgaagtt gaggatctct tactctctaa gccacggaat taacccgagc aggcatggag 60 

gcctctgctc tcacctcatc agcagtgacc agtgtggcca aagtggtcag ggtg-gcctct 12 0 

ggctctgccg tagttttg-cc cctggccagg attgctacag ttgtgattgg aggag-ttgtg 180 

gccatggcgg ctgtgcccat ggtgctcagt gccatgggct tcactgcggc gggaatcgcc 2 40 

tcgtcctcca tagcagccaa gatgatgtcc gcggcggcca ttgccaatgg gggtg-cjagtt 3 00 

gcctcgggca gccttgtg*g;g tactctgcag tcactgggag caactggact ctccg-cjattg 3 60 

accaagttca tcctgggctc cattgggtct gccattgcgg ctgtcattgc gaggt tctac 420 

tagctccctg cccctcgccc tgcagagaag agaaccatgc caggggagaa ggcacccagc 480 

catcctgacc cagcgaggag ccaactatcc caaatatacc tgggtgaaat ataccaaatt 540 

ctgcatctcc agaggaaaat aagaaataaa gatgaattgt tgcaactctt aaaaaaa 597 



<210> 184 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005532 



<400> 184 

agccaactat cccaaatata cc tgggtgaa atataccaaa ttctgcatct ccagaggaaa 60 



<210> 185 

<211> 1661 

<212> DNA 

<213> Homo sapiens 
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<300> 

<308> KTM_0055 66 
<400> 185 

tgctgcetgcc gctgccgccg attccggatc tcattgccac gcgcccccga cgaccgcccg 60 

acgtgcattc ccgattcctt ttggttccaa gtccaatatg gcaactctaa aggatcagct 12 0 

gatttataat cttctaaagg aagaacagac cccccagaat aagattacag ttgttggggt 180 

tggtgcfcgtt ggcatggcct gtgccatcag tatcttaatg aaggacttgg cagatgaact 240 

tgctcttgtt gatgtcatcg aagacaaatt gaagggagag atcjatggatc tccaacatgg 3 00 

cagccttttc cttagaacac caaagattgt ctctggcaaa gactataatg taactgcaaa 3 60 

ctccaagctg gtcattatca cggctggggc acgtcagcaa gagggagaaa gccgtcttaa 420 

tttggtccag cgtaacgtga acatatttaa attcatcatt cctaatgttg taaaatacag 480 

cccgaactgc aagttgctta ttgtttcaaa tccagtggat atcttgacct acgtggcttg 540 

gaagataagt ggttttccca aaaaccgtgt tattggaagt ggttgcaatc tggattcagc 6 00 

ccgattccgt tacctgatgg gggaaaggct gggagttcac ccattaagct gtcatgggtg 660 

ggtcctfcggg gaacatggag attccagtgt gcctgtatgg agtggaatga atgttgctgg 72 0 

tgtctctctg aagactctgc acccagattt agggactgat aaagataagg aacagtggaa 7 80 

agaggttcac aagcaggtgg ttgagagtgc ttatgaggtg atcaaactca aaggctacac 840 

atcctgg-gct attggactct ctgtagcaga tttggcagag agtataatga agaatcttag 900 

gcgggtgcac ccagtttcca ccatgattaa gggtctttac ggaataaagg atgatgtctt 9 60 

ccttagtgtt ccttgcattt tgggacagaa tggaatctca gaccttgtga aggtgactct 102 0 

gacttctgag gaagaggccc gtttgaagaa gagtgcagat acactttggg ggatccaaaa 1080 

ggagctgcaa ttttaaagtc ttctgatgtc atatcatttc actgtctagg ctacaacagg 1140 

attctag-gtg gaggttgtgc atgttgtcct ttttatctga tctgtgatta aagcagtaat 12 00 

attttaagat ggactgggaa aaacatcaac tcctgaagtt agaaataaga atggtttgta 12 6 0 

aaatccacag ctatatcctg atgctggatg gtattaatct tgtgtagtct tcaactggtt 132 0 

agtgtgaaat agttctgcca cctctgacgc accactgcca atgctgtacg tactgcattt 13 80 

gccccttgag ccaggtggat gtttaccgtg tgttatataa cttcctggct ccttcactga 1440 

acatgcctag tccaacattt tttcccagtg agtcacatcc tgggatccag tgtataaatc 1500 

caatatcatg tcttgtgcat aattcttcca aaggatctta ttttgtgaac tatatcagta 15 60 

gtgtacattta ccatataatg taaaaagatc tacatacaaa caatgcaacc aactatccaa 162 0 
gtgttatacc aactaaaacc cccaataaac cttgaacagt g 16 61 

<210> 186 

<211> GO 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> nNTM:_005566 
<400> 186 

catcaactcc tgaagttaga aataagaatg gtttgtaaaa tccacagcta tatcctgatg 60 

<210> 187 

<211> 2993 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> 3>3M_005689 
<400> 187 

gggcctg-cag ttggcagaag ggtcccgggc ccagagccag cg^ggccgtg ctgagacggc 60 

gtacgtgccc tgcgtgagtg cgtggcggcg gcgcgtgcgc ta^gggagtg ggcggtgagg 12 0 

cctggtccac gtgcgtccct tcccgggacc cccgcagctt ggcgcccagc ggctacgtga 180 

gccaaggcac ccggatgtcc gcgcccctct ccgagtgaca agtcccggcc tccggtcccg 240 

cagtgcccgc agcctcggcc ggcgtccacg cattgccatg gt^actgtgg gcaactactg 300 

cgaggccgaa gggcccgtgg gtccggcctg gatgcaggat ggcctgagtc cctgcttctt 3 60 

cttcacgctc gtgccctcga cgcggatggc tctagggact ct^gccttgg tgctggctct 42 0 
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tccctgcaga 
tcgcatctct 
ggccggcctg 
tctggcctcc 
gcggagccag 
tgg-tctcctg 
gaacagccca 
gtcg-ggtgctg 
acttcgtccc 
ccaggttcgg 
cctgagtggc 
cctggggctc 
cat tgtgaac 
ttacgtcttc 
cctgcgcacc 
catcttctcc 
ggtgctgcgg 
gttcaatgtc 
cttcaacgcc 
cattgtggtc 
tacccgggca 
cgagagttac 
gaagtcgagc 
gctcctcgcc 
ggactatgtg 
cacctactac 
gaaagaggag 
ccgtattgag 
cgtgtctttc 
gaagagcaca 
aatagatggg 
tgtgccccaa 
tgtcacagct 
cattatggct 
cgcjcggggag 
tctgctggat 
tcfcggccaaa 
ggtcaatgct 
cgaggctctg 
ggaagaaacc 
cttccctctc 
tcatcctggt 
catcttttgg 



cgccgggagc 
ccctacgtgc 
gctggccggg 
gtgctggaga 
gcacggcagc 
ctcctctgga 
cagtggtggt 
cggtatgtgg 
cagtcctata 
tcagcagccc 
tacctgtggc 
atgggtttgg 
ttgctgactg 
ctcaagttcc 
ttcctgtgga 
cacctgcacg 
atcgcggatc 
atccccacgc 
tggtttggcc 
actgagtgga 
cgagcagtgg 
gaagtggaac 
gcttcactgg 
ggctccctgc 
ctctttggca 
aggatgatcc 
acagaagtga 
tttgagaacg 
actgtgatgc 
attttgcgcc 
caggacattt 
gacactgtcc 
gggaatgatg 
ttccctgaag 
aagcagcgcg 
gaggcaacgt 
gtctgtgcca 
gaccagatcc 
ttgtcccgag 
tctgaagaca 
aaagactaac 
cttggggtat 
ggaaataaaa 



ggcccgctgg 
tgcagctgct 
tgggcactgc 
gtctggccgg 
gtctggcaat 
ctgtggcgtt 
gggcaagggc 
tctctggagg 
cattgcaggt 
aacagtctac 
ctcgagggag 
aacgggcact 
agaaggcacc 
tccagggggg 
tccgggtgca 
agctctcact 

ggggcacatc 

tggccgacat 
tcattgtgtt 
gaaccaagtt 
actctctgct 
gctatcgaga 
ttttactaaa 
tttgcgcata 
cctacattat 
agaccaactt 
aggaccttcc 
tgcacttcag 
ctggacagac 
tgctgtttcg 
cacaggtgac 
tctttaatga 
aggtggaggc 
ggtacaggac 
tcgccattgc 
cagcgctgga 
accgcaccac 
tcgtcatcaa 
gtggggtgta 
ctaagcctca 
ccagaaggga 
ggtgctagct 
atgtggactg 



tgctgattcg 
tctggccaca 
ccggggggcc 
cgcctgtggc 
gggcatctgg 
tgcagctgag 
agacttgggc 
gctgtttgtc 
tcatgaagag 
ctggcgagat 
tccagctctg 
caatgtgttg 
ttggaactct 
tggcactggc 
gcagttcacg 
gcgctggcac 
cagtgtcaca 
catcattggc 
cctgtgcatg 
tcgtcgtgct 
aaacttcgag 
ggccatcatc 
tcagacccag 
ctttgtcact 
ccagctgtac 
cattgacatg 
tggagcaggg 
ctatgccgat 
acttgccctg 
cttctacgac 
ccaggcctct 
caccatcgcc 
tgctgctcag 
acaggtgggc 
ccgcaccatc 
tacatctaat 
catcgtagtg 
ggatggctgc 
tgctgacatg 
gaccatggaa 
ataagatgtg 
atggtaaggg 
tgaaaaaaaa 



ctg-tcttggg 
cttcaggcgg 
ccactgccaa 
ctcjtggctgc 
atcaagttca 
aacttggccc 
caacaggttc 
ctcjggtctct 
gaccaagatg 
tttggcagga 
cacjctggtgg 
gtgcctatat 
ctcjgcctgga 
agtacaggct 
tctcggcggg 
ctcjgggcgcc 
ggg-ctgctca 
atcatctact 
agtctttacc 
atgaacacac 
acggtgaagt 
aaatatcagg 
aacctggtga 
gagcagaagc 
atgcccctca 
gagaacatgt 
ccccttcgct 
gggcgggaga 
gtgggcccat 
atcagctctg 
ctccggtctc 
gacaatatcc 
gctgcaggca 
gagcggggac 
ctcaaggctc 
gagagggcca 
gcacacaggc 
atcgtggaga 
tggcagctgc 
cggtgacaaa 
tctcctttcc 
aaagggacct 
aaaaaaaaaa 



gggccggccc 
cgctgcccct 
gctatctact 
ttgtcgtgga 
ggcacagccc 
tggtgtcttg 
agtttagcct 
gggcccctgg 
tggaaaggag 
agctccgcct 
tgctcatctg 
tctataggaa 
ctgttaccag 
tcgtgagcaa 
tggagctgct 
gcacagggga 
gctacctggt 
tcagcatgtt 
tcaccctgac 
aggagaacgc 
attacaacgc 
gtttggagtg 
ttgggctcgg 
tacaggttgg 
attggtttgg 
ttgacttgct 
ttcagaaggg 
ctctgcagga 
ctggggcagg 
gctgcatccg 
acattggagt 
gttacggccg 
tccatgatgc 
tgaagctgag 
cgggcatcat 
tccaggcttc 
tctcaactgt 

ggggacgaca 

agcagggaca 
agtttggcca 
ctggcttatt 
ttccgaaaaa 
aaa 2993 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 



<210> 188 

<211> 60 

<212> DNA 

<2 13> Homo sapiens 

<3O0> 

<308> NMJD05689 



<4O0> 188 

ggaaagggac ctttccgaaa aacatctttt ggggaaataa aaatgtggac tgtgaaaaaa 



60 



<210> 189 

<211> 1830 

<212> DNA 

<2 3_3> Homo sapiens 



<3O0> 
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<308:> NM_005749 



<400> 189 

ggggagttga 

taaattttat 

gtgaagaact 

catacaaagg 

aacaagcatc 

atcfc tagtgt 

tgaa^gtgct 

aaaacagctt 

tgtc cagctc 

cccggtccac 

ctac caaaat 

tcggcttgaa 

tgtatgggct 



aacctaattt 
tatttcgtat 
tgaaagactt 
atcggggttt 
caaagagagt 
ttggatcgac 
ttacgtggat 
taacccagag 
tccatcgcct 
tcagccttta 
gaagaatagt 
tgtgaatgac 
tggcttgggt 



tgtggcgtag 
ttgtacaata 
cttaagaaga 
agatgtatac 
ggtttggaca 
ccatttgagg 
gataataatg 
gcccaggttt 
ccttttggtc 
acctttacca 
ggccgtagca 
ctcttgaagc 
agccagcagc 



cagctatgca 
agcttcccag 
aatatgaagg 
acatagggga 
ttgatgatgt 
tttcttacca 
aaaatggatg 
ttatgcccat 
actctgctgc 
ctgccacttt 
acaaggttgc 
agaaagccat 
agccacagca 



gcttgaaatc 
gagacgtgtc 
gcactggtat 
gaaagtggac 
tcgtggcaat 
aattggtgaa 
tg-agttggat 
aagtgaccca 
tgtaagccct 
tgctgccacc 
acgtacttct 
ctcttcctca 
acagcagcag 



caagtagcac 
aacatttttg 
cctgaaaagc 
ccagtgattg 
ctgccacagg 
aagggaccag 
aaggagatca 
gcctcatcag 
accttcatgc 
aagttcggct 
cccatcaacc 
atgcactctc 
ccagcccagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 



cgccaccgcc 
ctaatgccaa 
tgtt cccagg 
tgtt tgcagc 
taaataacat 
aaaatgtatc 
tgaojaatttt 
taataaaaca 
gtatatttta 
catg-g-ctgat 
attgtataat 
tctacgacat 
tcatgttgta 
taaaaattgt 
tttcaaaaaa 
aaaatgtaat 
ataaaagtat 
gtttgtatta 



accaccacca 
ggaatttatt 
tgacagcccc 
ctatggaggc 
gcagtattct 
gtacaagtta 
tttttttaag 
tgcatcattt 
aagtatactc 
ttttttttat 
ttaatttagt 
ggtattgcat 
ttttttttac 
gtacaggata 
atgtattttc 
cgtaaatatt 
tttgagtcag 
aaattcttta 



ccacagcagc 
tttcctaata 
cttaacctca 
ctcaatgaga 
aaccagcaat 
aaatgcacgg 
cttatagtaa 
ttcatttgcc 
tcagatatgg 
aaaaattggc 
acaaagtttg 
ttatatcttt 
tgaaattgta 
ttcctttagt 
tgttaaaagt 
aattttgtac 
tgtcttacat 
aaattaaaaa 



aacaacagca 
tgcagggtca 
gtcctctcca 
agtcttttgt 
tccagcctgt 
gcccaagggg 
ggatacattc 
aaccaagcac 
cctcttacag 
actaataagt 
taaaatatca 
ttactacagt 
aaatatccat 
ggtggaatta 
ttaaagattt 
ctatattgtg 
gttaagaggg 
1830 



gaaaacctct 
aggtagtagt 
gtacagtaat 
agatggcttg 
tatggctaac 
gaitttttttt 
aagcttggtt 
aaagttattt 
tatttaagat 
gcg-gtttattg 
gsiggatatat 
gatctgtgac 
cttaaagaca 
aaatgtacga 
ttgctatata 
caatacttga 
actgaaatag 



gctctttctc 
accaatggaa 
gcctttgatg 
aattttagct 
taaaaaaaag 
ttcacctcct 
aaaaaaataa 
tatactgact 
atagcaagga 
gtcttttcta 
atatattgtt 
agcagcagct 
tcaactattc 
atacttgctt 
ttatggaaga 
aaaaaacggt 
tttatattaa 



840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 



<210> 190 

<213_> 60 

<212> DNA 

<213> Homo sapiens 

<30O> 

<308> NM_005749 
<40O> 190 

aaacctctgc tctttctcct aatgccaagg aatttatttt tcctaatatg cagggtcaag 60 

<210> 191 

<213_> 1534 

<212> DNA 

<213> Homo sapiens 

<30O> 

<308> NM„005804 



<40O> 191 

ggaa-gcgcag caactcgtgt ctgagcgccc 
agcagcgcgc ggagaagaac ggggagccag 
tcttttggat tacgatgaag aggaagagcc 



ggcggaaaac cg-aagttgga agtgtctctt 60 
catcatggca gaacaggatg tggaaaacga 12 0 
ccaggctcct caagagagca caccagctcc 180 
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ccctaagaaa 
tctgctgaag 
ggtccagcat 
gtccgg-gatg 
caacggacag 
caaggaatat 
tggtctctcc 
ggggaccccg 
gaagcacttt 
tgtgca.ggag 
cctgagcaag 
tgtggeicgac 
agacagtgag 
gataatcttc 
gaacttcccg 
tcagcagttc 
gatggacatc 
ctacctgcac 
tgtgtctgac 
tgtggcagaa 
accacgtgcc 
ggcgccactg 
aaaaaaaaaa 



gacatcaagg 
ccggagctcc 
gagtgcattc 
ggcaagacag 
gtgacggtcc 
gagcgctttt 
atcaagaagg 
ggccgcatcc 
gtgctggacg 
atcttccgcc 
gacatccggc 
gagaccaagc 
aagaaccgca 
gtcaagtcag 
gccatcgcca 
aaggatttcc 
gagcgagtca 
cgggtggccc 
gagaatgatg 
cttccagagg 
agagccgccc 
ttgagaagct 
aaaaaaaaaa 



gatcctacgt 
tgcgggccat 
cccaggccat 
cggtcttcgt 
tggtcatgtg 
ccaagtacat 
atgaagaagt 
tggcgctcgt 
agtgtgacaa 
tgacaccaca 
ctgtgtgcag 
tcacgctgca 
agctctttga 
tgcagcgctg 
tccaccgggg 
agcggcggat 
acatcgtctt 
gggcgggtcg 
ccaaaatcct 
aaatcgacat 
acccggagcc 
agagattgta 
aaaaaaaaaa 



ttccatccac 
cgtggactgt 
cctgggcatg 
gctggccacc 
ccacacgagg 
gcccagcgtc 
gttgaagaag 
gcggaatagg 
gatgctggag 
cgagaagcag 
gaagttcatg 
cggcctgcag 
tctcttggat 
catggccctg 
catggcccag 
cctggtggcc 
taactacgac 
ctttggcacc 
caatgacgtc 
ctccacatac 
gcccgcatgc 
tgagaataaa 
aaaa 1534 



agctctggct 
ggotttgagc 
gacgtcctgt 
ctacagcaga 
gag-ctggcct 
aaggtgtctg 
aactgtcccc 
agcttcagcc 
cagctggaca 
tgcatgatgt 
caggatccca 
cagtactacg 
gtg-ctggagt 
gcccagctcc 
gaggagcgcc 
accaatctgt 
atgcctgagg 
aaaggcctag 
caggaccggt 
atcgagcaga 
agcttcacct 
cttgttatta 



tccgggactt 
atccttctga 
gccaggccaa 
ttgagcctgt 
tccagatcag 
tgttcttcgg 
atgtcgtggt 
taaagaatgt 
tgcggcggga 
tcagcgccac 
tggaggtgtt 
tcaaactcaa 
ttaaccaggt 
tcgtggagca 
tgtcacgcta 
ttggccgggg 
actcggacac 
ccatcacttt 
ttgaagttaa 
gccggtaacc 
cccctttcca 
tggaaaaaaa 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 



<210> 192 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_005804 



<400> 192 

gttgagaagc tagagattgt atgagaataa agtgttatta tgaaatgaag aagcctcacc 



60 



<210> 193 

<211> 1416 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> Modif ied_base 

<222> 1 ... 1416 

<223> n = a,c,g, or t 

<300> 

<308> NM_005945 



<400> 193 

aggaattccg 

cgaacgccat 

tcccctgtac 

ggcgttcaag 

gtctgtcctc 

ttaccacaac 

gatttcjcgcg 

actgctattg 

tccgag-ttct 

aggtacatct 

gtggtg-tcta 



gaattccgga 
tctgggggtg 
cgccacatcg 
tgtcatcatc 
ccagtcggtg 
ctgaagaatg 
ggtttgctcc 
gaaagcctgg 
tcaggtcagg 
cgcctgacca 
tcgaagatcc 



attccgatgg 
tcccttgccg 
cgtacttggc 
aatggcggtt 
cagcaaactc 
tcatcaagga 
caacatcctg 
cctacactgt 
gaactatgac 
gctggctgac 
ctttgaccag 



atggaacaga 
tctgcaaagc 
tggcaacttc 
ctcatgctgg 
agggaagcca 
gaaatatggg 
gagaataaag 
aaaggtggtc 
ctggacttca 
ctgtacaagt 
gatgactggg 



aaataaatct 
tgg-tgccgtt 
'gaagtcatcc 
caacaagctg 
tgccgcattg 
aaagatgcca 
aaggcctgga 
atg-gcatgga 
agtctcccga 
ccttcatcaa 
gag;cttcaga 



aagtttggtg 
gagaaggggg 
tgccagtccc 
gccatgcaga 
gagcagaggt 
ccaatgtggg 
gctgctgaag 
cgtagcggcc 
tgaccccagc 
ggactaccca 
agttcacagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 
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cagtgcagga 
caaggcgtga 
gtgaccgagt 
tctcatcgtt 
actggggcag 
ctcctcagaa 
aaccccttgg 
ccctcccctg 
tgctagttag 
aagctccctg 
attcjtttttc 
tcccjgggtgg 
gacccatgaa 



atccaggtag tggggggatg actcacagtg accaacccaa agaggatcgc 72 0 

acgagaagtc ctgcaactgc ctcctgctca aagtcaacca gattggctcc 780 

ctcttcaggc gtgcaagctg gcccaggcca atcjgttgggg cgtcatggtg 840 

cgggggagac tgaagatacc ttcatcgctg acctggttgt ggggctgtgc 900 

atcaagactg gtgccccttg ccgatcacgc gcttggccaa gtacaaccag 960 

ttgaagagga gctgggcagc aaggctaagt ttgccggcag gaacttcaga 102 0 

ccaagtaagc tgtgggcagg caagccttcg gtcacctgtt ggctacagac 1080 

gtgtcagctc aggcagctcg aggcccccga ccaacacttg caggggtccc 1140 

cgcccaccgc cgtggagttc gtaccgcttc ctfcagaactc tacagaagcc 1200 

gaagccctgt tggcagctct agctttgcag ttgjtgtaatt ggcccaagtc 1260 

tcgccttact ttccaccaag tgtctagagt catgtgagcc tngtgtcatc 1320 

ccacaggcta gatccccggt ggttttgtgc tcaaaataaa aagcctcagt 13 80 
aaaaaaaaag gaattccgga attccg 1416 



<210> 194 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<30O> 

<308> NM_005945 



<40O> 194 

ttgtgtaatt ggcccaagtc attgtttttc tcgccttact ttccaccaag tgtctagagt 60 



<210> 195 

<2in_> 961 

<212> DNA 

<2 13 > Homo sapiens 

<30O> 

<308> NM_006014 



<40O> 195 

ggcgaccacg gtgtcttcaa aagccccgtc 
tggcjtcagtt tgcaccagcg ctctggaatc 
aggaaaccgc agcctctcaa ccgctgaccg 
ggccjggaact gaccacgcgc cagtcaggct 
ggacjtcgtgc gcagggggcg gggcttcggg 
cctcjcgcttc gggggtggag tcggagcggc 
gacgjcaggcg gaggcgctga cggcggggat 
gtggacacag ccgcagctcc ggccggtgga 
agagjacgccg cgtctgcggc cagggggtca 
gtgcctttcc cgaccccctt ggaggcggaa 
gagccccacc aaagggtggt tgggaaggat 
cgctggaaag ctgaagactg tcgcctgctc 
ctttccctgg tggtgcggac catgcagcgc 
cctgggcaaa tggagcgagg tcccactttg 
ccctagggca ggaattccca cagttgctac 
ttcttaaatg tttgttacta cagaaaataa 
a 961 



agggttggct tcctggggcc ggaccgactg 60 

gagttacgcg cgaaagggca gagtttctgg 12 0 

ggtctcagaa ggcccccggc agggccgctt 180 

ctccagggac ctcjcgcaggc gcgtgtgggc 240 

aaggagccac aga_gagggcg gggcgtagga 3 00 

gcggcggcgg tcatgcggga cgcggatgca 3 60 

ggccggggtg gccacagctg ccgcgggggc 42 0 

gctcccccag cgcacgcgcc aggtccgggc 480 

cgaatgcggc cgcacatatt caccctcagc 540 

atcgcccatg ggtccctggc accagatgcc 600 

ctcacagtga gtcjgcaggat cctggtcgtc 660 

cgaatttccg tcatcaactt tcttgaccag 72 0 

tttgggcccc cccjtttcccg ctaagcctgg 7 80 

cgtctccttg tacjgcagtgc gtccatcctt 840 

tttcctggga ggcjcctcatg ttttatctgg 900 

aactgcgcta ctaaaaaaaa aaaaaaaaaa 960 



<210> 196 

<213_> 60 

<212> DNA 

<213> Homo sapiens 

<30O> 

<308> NM_006014 



<40O> 196 
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ggcctcatgt tttatctggt tcttaaatgt ttgttactac agaaaataaa actgaggtat 60 

<210> 197 

<211> 1648 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006086 
<400> 197 

atgcgggaga tcgtgcacat ccaggccggc cagtgcggca accagatcgg ggccaagttc 60 

tgggaagtca tcagtgatga gcatggcatc gaccccagcg gcaactacgt gggcgactcg 12 0 

gacttgcagc tggagcggat cagcgtctac tacaacgagg cctcttctca caagtacgtg 180 

cctcgagcca ttctggtgga cctggaaccc ggaaccatgg acagtgtccg ctcaggggcc 240 

tttggacatc tcttcaggcc tgacaatttc atctttggtc aga^tggggc cggcaacaac 3 00 

tgggcoaagg gtcactacac ggagggggcg gagctggtgg attcggtcct ggatgtggtg 360 

cggaaggagt gtgaaaactg cgactgcctg cagggcttcc agctgaccca ctcgctgggg 42 0 

ggggg"Q*acgg gctccggcat gggcacgttg ctcatcagca aggtgcgtga ggagtatccc 480 

gaccgcatca tgaacacctt cagcgtcgtg ccctcaccca agg-tgtcaga cacggtggtg 540 

gaaccctaca acgccacgct gtccatccac cagctggtgg aaaacacgga tgaaacctac 600 

tgcatcgaca acgaggcgct ctacgacatc tgcttccgca ccctcaagct ggccacgccc 660 

acctacgggg acctcaacca cctggtatcg gccaccatga gcggagtcac cacctccttg 72 0 

cgcttcccgg gccagctcaa cgctgacctg cgcaagctgg ccgtcaacat ggtgcccttc 780 

ccgcgcctgc acttcttcat gcccggcttc gcccccctca ccaggcgggg cagccagcag 840 

taccgggccc tgaccgtgcc cgagctcacc cagcagatgt tcgatgccaa gaacatgatg 900 

gccgcctgcg acccgcgcca cggccgctac ctgacggtgg ccaccgtgtt ccggggccgc 9 60 

atgtccatga aggaggtgga cgagcagatg ctggccatcc agagcaagaa cagcagctac 102 0 

ttcgtg-gagt ggatccccaa caacgtgaag gtggccgtgt gtg-acatccc gccccgcggc 1080 

ctcaagatgt cctccacctt catcgggaac agcacggcca tccaggagct gttcaagcgc 1140 

atctccgagc agttcacggc catgttccgg cgcaaggcct tcctgcactg gtacacgggc 12 00 

gagggcatgg acgagatgga gttcaccgag gccgagagca acatgaacga cctggtgtcc 12 60 

gagtaccagc agtaccagga cgccacggcc gaggaagagg gcg-agatgta cgaagacgac 132 0 

gaggaggagt cggaggccca gggccccaag tgaaactgct cgcagctgga gtgagaggca 13 80 

ggtggcggcc ggggccgaag ccagcagtgt ctaaaccccc gga_gccatct tgctgccgac 1440 

accctgcttt ccccatcgcc ctagggctcc cttgccgccc tcctgcagta tttatggcct 1500 

cgtcctcccc cacctaggcc acgtgtgagc tgctcctgtc tctgtcttat tgcagctcca 1560 

ggcctgacgt tttacggttt tgttttttac tggtttgtgt ttatattttc ggggatactt 162 0 
aataaatcta ttgctgtcag ataccctt 1648 

<210> 198 

<211>- 60 

<212> DNA 

<213> Homo sapiens 

<300^ 

<308>- NM_006086 
<400^ 198 

tttttactgg tttgtgttta tattttcggg gatacttaat aaatctattg ctgtcagata 60 

<210> 199 

<211>- 3074 

<212>- DNA 

<213^ Homo sapiens 

<300^ 

<308=- NM_006096 
<400>~ 199 

aacaaacctc gcctggctcc cagctggtgc tgaagctcgt cagttcacca tccgccctcg 60 



89 



WO 2005/086891 



PCT/US2005/007894 



gcttccgcgg ggcgctgggc cgccagcctc ggcaccgtcc tttcctttct ccctcgcgtt 

aggcaggtga cagcagggac atgtctcggg agatgcagga tgtagacctc gctgaggtga 

agcctttggt ggagaaaggg gagaccatca ccggcctcct gcaagagttt gatgtccagg 

agcaggacat cgagacttta catggctctg ttcacgtcac gotgtgtggg actcccaagg 



gaaaccggcc 
accccctctt 
tggacgcccc 
ccatggatca 
ttattggcat 
ctgagatggt 
actgggccgc 
acctttttgg 
acattgtgaa 
ggcgcgacct 
ctgctctgtt 
caaaattgga 
agatctccca 
acatgccctc 
tcacttctct 
cccacaccag 
tcacccccaa 
gctaggcggc 
cccggcccct 
ttattttgta 
ccaaaggcgg 
ttccattaac 
aaaaacaaga 
cctggttcaa 
ctgaggagaa 
taatagtgac 
attcatgtcg 
cgatttttaa 
ttctagcatc 
aaaaatattt 
cctctcctcc 
cagggagaag 
gctgccctgg 
tggaattggg 
tgtgatttct 
ttttcaatct 
gtcacgttct 
tggggagcgg 
ttccctctcc 
ctgacgtttc 
cattttggct 
aagtcagcca 
cagccttcca 
ccttccctgg 
ggaaactatt 
tccagaccaa 
aaaaaaaaaa 



tgtcatcctc 
caactacgag 
tggccagcag 
gctggctgaa 
gggaacagga 
ggagggcctt 
ctccaagatc 
gaaggaagaa 
tgacatgaac 
ggagattgag 
ggtggttggg 
cccaacaaag 
gccggccaag 
ggctagcatg 
ggatggcacc 
cgagggcacc 
ctcgggtgct 
ctgcccagct 
tttcgccccc 
agagtgagct 
ggtggcggtg 
cagtggccgg 
cgcgtagcag 
acgggtgcct 
gctgatccag 
atgcaggcac 
aggctagagg 
agttggtgca 
ctcttaatgt 
cagcaaacgt 
acagccccaa 
atcatttaga 
ctgtgatgag 
gttactcgat 
gtggatccca 
tcgacagctg 
tggatcctca 
tggctgagaa 
tgcttcctct 
tgggcggcca 
caccgtggat 
ctgggacccg 
ggtggcgtga 
ccttgagtag 
gtgcacaagt 
taaatttgta 
aaaa 3 074 



acctaccatg 
gacatgcagg 
gacggcgcag 
atgcttcctg 
gcaggcgcct 
gtccttatca 
tcaggatgga 
atgcagagta 
cccggcaacc 
cgaccaatgc 
gacagctcgc 
accactctcc 
ctcgctgagg 
acccgcctga 
cgcagccgct 
cgcagccgct 
gctgggaaca 
gccgcccccg 
tgcctgccat 
ctggtggaga 
gaccaaagga 
ttgccactct 
cacacacttc 
ggtcagaagg 
tttccggaaa 
ctcttttaaa 
catttggaac 
tctagaaagc 
gcagcaaaag 
tgggcatcat 
cctcccaccc 
tttgttttgc 
tgtccttgca 
gtaagggatt 
gcttggttcc 
ggctggaacg 
gaactctttg 
aatgtaagga 
tttcctgctc 
gtgcggctgc 
tttctcatag 
aggatttctg 
gaggcaatga 
aaaagtcggg 
ctttccagag 
actttgcagc 



acatcggcat 
agatcaccca 
cctccttccc 
gagtccttca 
acatcctaac 
acgtgaaccc 
cccaagctct 
acgtggaagt 
tgcacctgtt 
cgggaaccca 
ctgcagtgga 
tcaagatggc 
ccttcaagta 
tgcggtcccg 
cccacaccag 
cgcacaccag 
gcgccgggcc 
gactctgatc 
actgcgccta 
caaatgaggt 
aggaagcaag 
cctcccctcc 
acaaagccaa 
ccagccgccc 
caaaatcctt 
caggcaaaac 
aacaaatcta 
tttgaatgca 
caggcgacaa 
ggtttttgaa 
ctgatacatg 
attccttaga 
ggggccggag 
ccttgttgtt 
aggaattttg 
tgaactcagt 
ctcttgtcgg 
ttctggaata 
cctaaccttt 
caggttcctg 
gaagtttggt 
ggaccccgca 
ctcgttacct 
gatcggggca 
gagtttctta 
ggaaaaaaaa 



gaaccacaaa 
gcactttgcc 
ccgcagggtac 
acagtttggg 
tcgatttgct 
ttgtgcggaa 
gccggacatg 
ggtccacacc 
catcaatgcc 
cacagtcacc 
t^ccgtggtg 
gcg-actgtggc 
cttcgtgcag 
cacagcctct 
cgagggcacc 
ccgagggggcc 
caagtccatg 
t ctgtagtgg 
actcggtatt 
ctattacgtg 
catctccgca 
ctcagagaca 
goctaggccg 
acttcccgtt 
ttctcatttg 
a^gaaggggg 
cgtagttaac 
gaagcaaaca 
aatctcctgg 
gcg-ctttagtt 
agccagtgat 
atggagggca 
taggagcact 
gtgttgagat 
tgtgattggc 
agctgaacct 
g-^tgggggtg 
catattccat 
cgccgaatgg 
tactactgcc 
c agagtgaat 
g-ttgggagga 
gccgcccatc 
agagaggctg 
atgagatatt 
aaaaaaaaaa 



acctgctaca 
gtctgccacg 
atgtacccct 
ctgaaaagca 
ctaaacaacc 
ggctggatgg 
gtggtgtccc 
taccgccagc 
tacaacagcc 
ctgcagtgcc 
gagtgcaact 
ggcctcccgc 
ggcatgggat 
ggttccagcg 
cgaagccgct 
cacctggaca 
gaggtctcct 
ccccctcctc 
aatccaaagc 
ggtgccctct 
tcgcatcctc 
ccaaactgcc 
ccctgagcat 
tcctctttaa 
gggagggggg 
aaaaggtggg 
ttgaagaaac 
agcttgattt 
ctttacagac 
ctgctttctg 
tattcttgtt 
acattccaca 
ggggtggggg 
ccagtgcagt 
ttaaatccag 
gtctgacccg 
ggaactcacg 
gggactttcc 
ggcagcacca 
ttgtactttt 
tgaatattgt 
ggaagtagtc 
accttggagg 
agtacggatg 
tgtatttatt 
aaaaaaaaaa 



120 
180 
240 
300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 



<210> 200 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006096 
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<400> 200 

gagtacggat gggaaactat tgtgcacaag tctttccaga gg&gtttctt aatgagatat 6 0 

<210> 201 

<211> 2148 

<212> DNA 

<213> Homo sapiens 

<30O> 

<308> NM_006115 
<400> 201 

gcttcsigggt acagctcccc cgcagccaga agccgggcct gcagcgcctc agcaccgctc 6 0 

cgggacaccc cacccgcttc ccaggcgtga cctgtcaaca gcaacttcgc ggtgtggtga 12 0 

actctctgag gaaaaaccat tttgattatt actctcagac gtQcgtggca acaagtgact 180 

gagacctaga aatccaagcg ttggaggtcc tgaggccagc ctaagtcgct tcaaaatgga 2 40 

acgaag-gcgt ttgtggggtt ccattcagag ccgatacatc agcatgagtg tgtggacaag 3 00 

cccacggaga cttgtggagc tggcagggca gagcctgctg aaggatgagg ccctggccat 3 60 

tgccgccctg gagttgctgc ccagggagct cttcccgcca ctcttcatgg cagcctttga 42 0 

cgggagacac agccagaccc tgaaggcaat ggtgcaggcc tggcccttca cctgcctccc 480 

tctggg-agtg ctgatgaagg gacaacatct tcacctggag accttcaaag ctgtgcttga 540 

tggacttgat gtgctccttg cccaggaggt tcgccccagg aggtggaaac ttcaagtgct 600 

ggatttacgg aagaactctc atcaggactt ctggactgta tgg-tctggaa acagggccag 6 60 

tctgtactca tttccagagc cagaagcagc tcagcccatg acaaagaagc gaaaagtaga 720 

tggtttgagc acagaggcag agcagccctt cattccagta gag-gtgctcg tagacctgtt 7 80 

cctcaaggaa ggtgcctgtg atgaattgtt ctcctacctc attgagaaag tgaagcgaaa 840 

gaaaaatgta ctacgcctgt gctgtaagaa gctgaagatt tttgcaatgc ccatgcagga 9 00 

tatca<agatg atcctgaaaa tggtgcagct ggactctatt gaagatttgg aagtgacttg 960 

tacctg-gaag ctacccacct tggcgaaatt ttctccttac ctgggccaga tgattaatct 102 0 

gcgtagactc ctcctctccc acatccatgc atcttcctac atttccccgg agaaggaaga 1080 

gcagtatatc gcccagttca cctctcagtt cctcagtctg cagtgcctgc aggctctcta 1140 

tgtggactct ttatttttcc ttagaggccg cctggatcag ttgctcaggc acgtgatgaa 12 00 

ccccttggaa accctctcaa taactaactg ccggctttcg gaaggggatg tgatgcatct 12 60 

gtccceigagt cccagcgtca gtcagctaag tgtcctgagt ctcaagtgggg tcatgctgac 132 0 

cgatgtaagt cccgagcccc tccaagctct gctggagaga gcctctgcca ccctccagga 13 80 

cctgg-tcttt gatgagtgtg ggatcacgga tgatcagctc cttgccctcc tgccttccct 1440 

gagccactgc tcccagctta caaccttaag cttctacggg aattccatct ccatatctgc 1500 

cttgcagagt ctcctgcagc acctcatcgg gctgagcaat ctgacccacg tgctgtatcc 156 0 

tgtccccctg gagagttatg aggacatcca tggtaccctc cacctggaga ggcttgccta 162 0 

tctgcatgcc aggctcaggg agttgctgtg tgagttgggg cggcccagca tggtctggct 1680 

tagtgccaac ccctgtcctc actgtgggga cagaaccttc tatgacccgg agcccatcct 1740 

gtgcccctgt ttcatgccta actagctggg tgcacatatc aaatgcttca ttctgcatac 1800 

ttggacacta aagccaggat gtgcatgcat cttgaagcaa caaagcagcc acagtttcag 186 0 

acaaatgttc agtgtgagtg aggaaaacat gttcagtgag gaaaaaacat tcagacaaat 192 0 

gttca^tgag gaaaaaaagg ggaagttggg gataggcaga tgttgacttg aggagttaat 1980 

gtgatctttg gggagataca tcttatagag ttagaaatag aatctgaatt tctaaaggga 2 040 

gattc-bggct tgggaagtac atgtaggagt taatccctgt gtagactgtt gtaaagaaac 2100 
tgttg-aaaat aaagagaagc aatgtgaagc aaaaaaaaaa aaaaaaaa 2148 

<210^ 202 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300=- 

<308> NM_006115 
<400>- 202 

tgggg-cigata catcttatag agttagaaat agaatctgaa tttctaaagg gagattctgg 60 
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<210> 203 

<211> 1051 

<212> DNA 

<213> Homo sapiens 

<30O> 

<30 8> NM_006332 



<40O> 203 

ggaccgccgc 

tgcccctgct 

gctggacgtc 

gaatgggcca 

gtccaatgca 

agccttcctg 

cacgctggtg 

gtgccagcat 

acttgacatg 

gagaagtctg 

ggagtgtgca 

tgctctccag 

agatcagacc 

ctgcccttcc 

cggagagctc 

cctcggcacc 

caaaattcca 

agtttaataa 



ctggttaaag 
gcgatgaccc 
cccacggcgg 
ccagttaact 
ccgcttgtca 
atccgggagc 
ccctacggaa 
ggagaagagg 
gagctagcct 
ccactatgcc 
atgggggacc 
ccaccacacg 
cagctcctta 
tcaaccagct 
atggaaggcg 
tgctacttac 
ccccatgatc 
gcaaaaaaaa 



gcgcttattt 
tgtcgccact 
cggtgcaggc 
acaagacagg 
atgtgaccct 
tcttcccaac 
acgcacagga 
agtgcaaatt 
tcctgaccat 
tgcagctcta 
gcggcatgca 
agtatgtgcc 
cccttgtctg 
ccctcaggag 
agtgggaacc 
caactggaaa 
aagaatcctg 
aaaaaaaaaa 



cccaggcagc 
tctgctgttc 
gtcccctctg 
caatctatac 
ctactatgaa 
atggctgttg 
acaaaatgtc 
caacaaggtg 
tgtctgcatg 
cgccccaggg 
gctcatgcac 
ctgggtcacc 
ccagttgtac 
tgtttgcttc 
cggctgcctg 
attttatgca 
ctccactaag 
a 1051 



cgctgcagtc 
ctgccaccgc 
caagcgttag 
ctgcg-ggggc 
gcactgtgcg 
gtcatggaga 
agtgg-caggt 
gaggcctgcg 
gaaga-gtttg 
ctgtcgccag 
gccaacgccc 
gtcaaLtggga 
cagggcaaga 
aagtcjatggc 
ccttfcttttc 
tcccatgaag 
aatgg-tgcta 



gccacacctt 
tgctgctgct 
acttctttgg 
ccctgaagaa 
gtggctgccg 
tcctcaatgt 
gggagttcaa 
tgttggatga 
aggacatgga 
acactatcat 
agcggacaga 
aacccttgga 
agccggatgt 
cggtgagctg 
tgatccagac 
cccagataca 
aagtaaaact 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 



<210> 204 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<30 8> NM_006332 
<400> 204 

aaattccacc cctagatcaa gaatcctgct ccactaagaa tggtg-ctaaa gtaaaactag 60. 



<210> 205 



<211> 1714 

<212> DNA 

<213> Homo sapiens 

<300> 

<30 8> NM_006417 



<400> 205 

ggg~<jcatttt gtgcctgcct agctatccag acagagcagc taccctcagc tctagctgat 60 

actacagaca gtacaacaga tcaagaagta tggcagtgac aactcgtttg acacggttgc 12 0 

acgaaaagat cctgcaaaat cattttggag ggaagcggct tagccttctc tataagggta 180 

gtg-tccatgg attccgtaat ggagttttgc ttgacagatg ttgtaatcaa gggcctactc 240 

taacagtgat ttatagtgaa gatcatatta ttggagcata tgcgcjaagag agttaccagg 3 00 

aaggaaagta tgcttccatc atcctttttg cacttcaaga tactaaaatt tcagaatgga 3 60 

aactaggact atgtacacca gaaacactgt tttgttgtga tgttacaaaa tataactccc 42 0 

caactaattt ccagatagat ggaagaaata gaaaagtgat tatggactta aagacaatgg 480 

aaaatcttgg acttgctcaa aattgtacta tctctattca ggattatgaa gtttttcgat 540 

gcgaagattc actggatgaa agaaagataa aaggggtcat tgagctcagg aagagcttac 600 

tgtctgcctt gagaacttat gaaccatatg gatccctggt tcaacaaata cgaattctgc 660 

tgctgggtcc aattggagct gggaagtcca gctttttcaa ctcag-tgagg tctgttttcc 72 0 
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etagggcatgt 
ataggacata 
actcactggg 
tgaacggtaa 
atcatgacta 
ttgatgccag 
cfa.agggagtt 
tg-gatttgat 
agctagagga 
attattcctc 
gacgaatgct 
atctaaggga 
cgtaaatttc 
agaagtatct 
aacattgtag 
agtaataatt 
ctcggaaaaa 



aacgcatcag 
ctctattaga 
gctgagtgag 
cattcgtgat 
cattgattcc 
ctctattcaa 
ggtaaacgct 
tacaaaaggt 
agtccaaaga 
tgagtgggag 
atgggctgca 
ggaaattatc 
ctcacatcac 
aagaccaaag 
tacttgtaaa 
tttcttggat 
aaaaaaaaaa 



gctttggtgg 
gacgggaaag 
aaagaaggcg 
agataccagt 
ccatcgctga 
tacttctcct 
ggtgtggtac 
gaccttatag 
aaacttggat 
ctggaccctg 
gatgacttct 
aactgtgcac 
agaagattaa 
ggatgtgttt 
taactagaaa 
ttatgttctg 
aaaaaaaaaa 



gcactaatac 
atggcaaata 
gcctgtgcag 
ttaatcccat 
aggacagaat 
ctcagatgat 
atgtggcttt 
aaatagagag 
ttgctctttc 
taaaggatg-t 
tagaggattt 
aaggaaaaaa 
aattcagaaa 
tattaatgtc 
taacatgatt 
tatctgtga_a 
aaaa 173_4 



aactgggata 
cctgccgttt 
ggatgacata 
ggaatcaatc 
tcattgtgtg 
agtaaagatc 
gctcactcat 
atgtgagcct 
tgacatctcg 
tctaattctt 
gccttttgag 
atagatatgt 
ggagaaaaca 
taggatgaag 
tagtcataat 
aaaataaatt 



tctgagaagt 
attctgtgtg 
ttctatatct 
aaattaaatc 
gcatttgtat 
aaaagaattc 
gtggatagca 
gtgaggtcca 
gtggttagca 
tctgctctga 
caaataggga 
gaaaggttca 
cagaccaaag 
aaatgcatag 
tgtgaaaaat 
tcttataaaa 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 



<210> 206 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006417 
<400> 206 

atgacatatt ctatatcttg aacggtaaca ttcgtgata_g ataccagttt aatcccatgg 

<210> 207 

<211> 3791 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006461 



60 



<400> 207 

acagacggcg 

tggttgaaga 

ggaaaaccat 

acctctggaa 

c tgcaggaag 

ttatcttcag 

gatgagcagc 

gcagtagacc 

gggcagcaac 

atatctttaa 

atgggagaca 

ccgtcccatc 

aaggaaagcc 

aacgccttct 

ttccgtgtca 

agagaaatga 

tcaagtgtgg 

caagaagctc 

tcttggatgt 

aatctccgcc 

cctttctcta 

acaaacacat 

cfccctgtgtg 



ggtgaacatg 
tgtggcgagt 
ctatgagaac 
aaagatcccc 
gcagcaacaa 
aacatttcag 
ctctagatcc 
cactgggcaa 
aagacatgat 
atggaccttt 
ggttttcaga 
tcttagagga 
tgagcagtag 
tgccttcctc 
atcatgtgga 
ggtttcccac 
aagatattct 
caggcccagc 
ccccactggc 
aaagcttatc 
cttgctcggt 
cccagacagg 
gccggcctcc 



gcgtcctcga 
gaaaaaactg 
tcctctccgt 
cgcttgctcc 
ctcgtctcca 
tcattcctca 
aattccccaa 
ttatatggtt 
atttgaggcc 
gagaacagac 
agttgctgct 
gtctccacca 
aactgaggct 
tgttctctgg 
cccagaggag 
acatcctaag 
gtccacatgc 
agtagaagat 
ctggctggaa 
ccttccctcg 
ggggacttgg 
cctggttggc 
agatctgact 



cttggtctg-a. 
agcctcagc c 
gaacttacc c 
tcgctgaccc 
gtggatttt^ 
aagtggcta^- 
attagctctsi 
aaaaccatcg 
cgtttagat«a. 
gatctggtg-«a. 
gtatctgag^a. 
aatccctgt-fc 
gtgcgtgag^ 
ctttcccct-t 
gaaattgtacj 
gagtctgaas. 
ctgacaccaa 
gttggtaggs. 
aaaggtgtas. 
atgcttcggc? 
tttactcct-t: 
accaagcaca 
gccttgtctc 



gacgtgatag 
tgtcgccttc 
tgcagcccgg 
catcactgtg 
taaataacaa 
aaacttgtca 
ctcctaaaac 
tccttgtacc 
ccatggcaga 
gagaggaggt 
aacctatctt 
ctgaacaact 
acttagtacc 
caactgcctt 
agcatggagc 
cagaagatca 
atctagtaga 
ttcttggctc 
atacctccgt 
atgctgcaat 
cagcaccaca 
gtacttctga 
gacatgactt 



gcctgccttc 
gccccagacg 
tgccctcacc 
caagctgggg 
gaggacagac 
gcatgaatca 
gtctgaggaa 
atctccactg 
gacaaacagc 
ggcaccctgc 
tcaggaatct 
acattgctcc 
ttctgaaagt 
ggcagcagat 
tatggaggaa 
agcacttgtc 
aatggaatcc 
tgatacagag 
catgctggaa 
tggcactacc 
ggaaaagagt 
gacagagcag 
ggaagataac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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ctgctgagct ctcttgtcat tgtggagttt ctctcccgcc ag-cttcggga ctggaagagc 1440 

cagctggctg tccctcaccc agaaacccag gacagtagca cacagactga cacatctcac 1500 

agtgggataa ctaataaact tcagcatctt aaggagagcc atgagatggg acaggcccta 1560 

cagcaggcca gaaatgtcat gcaatcatgg gtgcttatct ctaaagagct gatatccttg 162 0 

cttcacctat ccctgttgca tttagaagaa gataagacta ctgtgaatca ggagtctcgg 1680 

cgtgcagaaa cattggtctg ttgctgtttt gatttgctga agaaattgag ggcaaagctc 1740 

cagagcctca aagcagaaag ggaggaggca aggcacagag aggaaatggc tctcagaggc 1800 

aaggatgcgg cagagatagt gttggaggct ttctgtgcac acgccagcca gcgcatcagc 1860 

cagctggaac aggacctagc atccatgcgg gaattcagag gccttctgaa ggatgcccag 192 0 

acccaactgg tagggcttca tgccaagcaa gaagagctgg ttcagcagac agtgagtctt 1980 

acttctacct tgcaacaaga ctggaggtcc atgcaactgg at tatacaac atggacagct 2 040 

ttgctgagtc ggtcccgaca actcacagag aaactcacag tcaagagcca gcaagccctg 2100 

caggaacgtg atgtggcaat tgaggaaaag caggaggttt ctagggtgct ggaacaagtc 2160 

tctgcccagt tagaggagtg caaaggccaa acagaacaac tg-gagttgga aaacattcgt 2220 

ctagcaacag atctccgggc tcagttgcag attctggcca acatggacag ccagctaaaa 2280 

gagctacaga gtcagcatac ccattgtgcc caggacctgg ctatgaagga tgagttactc 2340 

tgccagctta cccagagcaa tgaggagcag gctgctcaat gcgtaaagga agagatggca 2400 

ctaaaacaca tgcaggcaga actgcagcag caacaagctg tcctggccaa agaggtgcgg 2460 

gacctgaaag agaccttgga gtttgcagac caggagaatc aggttgctca cctggagctg 2520 

ggtcciggttg agtgtcaatt gaaaaccaca ctggaagtgc tccgggagcg cagcttgcag 2580 

tgtgagaacc tcaaggacac tgtagagaac ctaacggcta aactggccag caccatagca 2640 

gataaccagg agcaagatct ggagaaaaca cggcagtact ctcaaaagct agggctgctg 2700 

actgagcaac tacagagcct gactctcttt ctacagacaa aactaaagga gaagactgaa 2760 

caagagaccc ttctgctgag tacagcctgt cctcccaccc aggaacaccc tctgcctaat 2 82 0 

gacaggacct tcctgggaag catcttgaca gcagtggcag atgaagagcc agaatcaact 2 880 

cctgtgccct tgcttggaag tgacaagagt gctttcaccc gagtagcatc aatggtttcc 2940 

cttcagcccg cagagacccc aggcatggag gagagcctgg cagaaatgag tattatgact 3 000 

actgagcttc agagtctttg ttccctgcta caagagtcta aagaagaagc catcaggact 3 060 

ctgcagcgaa aaatttgtga gctgcaagct aggctgcagg cccaggaaga acagcatcag 312 0 

gaagtccaga aggcaaaaga agcagacata gagaagctga accaggcctt gtgcttgcgc 3180 

tacaagaatg aaaaggagct ccaggaagtg atacagcaga atgagaagat cctagaacag 3240 

atagacaaga gtggcgagct cataagcctt agagaggagg tgacccacct tacccgctca 33 00 

cttc^gcgtg cggagacaga gaccaaagtg ctccaggagg cc"tggcaggc cagctggact 33 60 

ccaactgcca gcctatggcc accaattgga tccaggagaa ag-tgtggctc tctcaggagg 342 0 

tggacaaact gagagtgatg ttcctggaga tgaaaaatga gaaggaaaac tcctgatcaa 3480 

gttccagagc ccatagaaat atcctagagg agaaccttcg gcgctctgac aaggagttag 3540 

aaaaactaga tgacattgtt cagcatattt ataagaccct gctctctatt ccagaggtgg 3600 

tgagg-ggatg caaagaacta cagggattgc tggaatttct gagctaagaa actgaaagcc 3 660 

agaatttgtt tcacctcttt ttacctgcaa taccccctta ccccaatacc aagaccaact 372 0 

ggcatagagc caactgagat aaatgctatt taaataaagt gtatttaatg aaaaaaaaaa 3780 
aaaaaaaaaa a 3 791 

<210> 208 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006461 
<400> 208 

ctgacaagga gttagaaaaa ctagatgaca ttgttcagca tatttataag accctgctct 60 

<210> 209 

<211=> 2856 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> ISIM_006516 
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<400> 209 

tagtcgcggg 

gtcagagtcg 

cgcacgcccg 

t ggagcccag 

ttggctccct 

aggagttcta 

tcaccacgct 

tctctgtggg 

tgctggcctt 

tgctgatcct 

ccatgtatgt 

agctgggcat 

gcaacaagga 

gcatcgtgct 

agaaccgggc 

tgcaggagat 

agctgttccg 

cccagcagct 

cg-ggggtgca 

ctgtcgtgtc 

tcgctggcat 

taccctggat 

tcjggtcctgg 

cagctgccat 

gcttccagta 

tcj-gttctgtt 

atgagatcgc 

agctgttcca 

gcctgctccc 

aacctgacag 

ccagaagaat 

aaatctattc 

atatcagcct 

geigggtggag 

ctggacctat 

gaggtggcta 

cattaggatt 

cctgagacca 

gccgggttct 

gggagcctgc 

t gcaagatat 

atatctggac 

tataaatggc 

tttggatggg 

gactcaggat 

tttgatccct 

atcacatatt 

aggcttgaaa 



tccccgagtg agcacgccag ggagcagg^.g accaaacgac gggggtcgga 60 

cagtgggagt ccccggaccg gagcacgagc ctgagcggga gagcgccgct 12 0 

tcgccacccg cgtacccggc gcagccagag ccaccagcgc agcgctgcca 180 

cagcaagaag ctgacgggtc gcctcatgct ggctgtggga ggagcagtgc 240 

gcagtttggc tacaacactg gagtcatcaa tgccccccag aaggtgatcg 3 00 

caaccagaca tgggtccacc gctatgggcja gagcatcctg cccaccacgc 360 

ctggtccctc tcagtggcca tcttttctcjt tgggggcatg attggctcct 420 

ccttttcgtt aaccgctttg gccggcggsa ttcaatgctg atgatgaacc 480 

cgtgtccgcc gtgctcatgg gcttctcg<aa actgggcaag tcctttgaga 540 

gggccgcttc atcatcggtg tgtactgccjg cctgaccaca ggcttcgtgc 600 

gggtgaagtg tcacccacag cctttcgtcjg ggccctgggc accctgcacc 660 

cgtcgtcggc atcctcatcg cccaggtg-fct cggcctggac tccatcatgg 72 0 

cctgtggccc ctgctgctga gcatcatctt catcccggcc ctgctgcagt 7 80 

gcccttctgc cccgagagtc cccgcttcct gctcatcaac cgcaacgagg 840 

caagagtgtg ctaaagaagc tgcgcggg«a.c agctgacgtg acccatgacc 900 

gaaggaagag agtcggcaga tgatgcggcja gaagaaggtc accatcctgg 960 

ctcccccgcc taccgccagc ccatcctcat cgctgtggtg ctgcagctgt 102 0 

gtctggcatc aacgctgtct tctattactc cacgagcatc ttcgagaagg 1080 

gcagcctgtg tatgccacca ttggctcccjg tatcgtcaac acggccttca 1140 

gctgtttgtg gtggagcgag caggccggcg gaccctgcac ctcataggcc 12 00 

ggcgggttgt gccatactca tgaccatcgc gctagcactg ctggagcagc 12 60 

gtcctatctg agcatcgtgg ccatctttc/g ctttgtggcc ttctttgaag 132 0 

ccccatccca tggttcatcg tggctgaact cttcagccag ggtccacgtc 1380 

tgccgttgca ggcttctcca actggacctc aaatttcatt gtgggcatgt 1440 

tgtggagcaa ctgtgtggtc cctacgtc-tt catcatcttc actgtgctcc 1500 

cttcatcttc acctacttca aagttcctcja gactaaaggc cggaccttcg 15 60 

ttccggcttc cggcaggggg gage cage ca aagtgataag acacccgagg 1620 

tcccctgggg gctgattccc aagtgtgacjt cgccccagat caccagcccg 1680 

agcagcccta aggatctctc aggagcacag gcagctggat gagacttcca 1740 

atgtcagccg agccgggcct ggggctcctt tctccagcca gcaatgatgt 1800 

attcaggact taacggctcc aggattttaa caaaagcaag actgttgctc 1860 

agacaagcaa caggttttat aatttttt*ta ttactgattt tgttattttt 192 0 

gagtctcctg tgcccacatc ccaggctt ca ccctgaatgg ttccatgcct 1980 

actaagccct gtcgagacac ttgccttctzt cacccagcta atctgtaggg 2 040 

gtcctaagga cacactaatc gaactatgaa ctacaaagct tctatcccag 2100 

tggccacccg ttctgctggc ctggatctcc ccactctagg ggtcaggctc 2160 

tgccccttcc catctcttcc tacccaac ca ctcaaattaa tctttcttta 2220 

gttgggagca ctggagtgca gggaggagaig gggaagggee agtctgggct 22 80 

agtctccttt gcactgaggg ccacactatt accatgagaa gagggectgt 2340 

aaactcactg ctcaagaaga catggagact cctgccctgt tgtgtataga 2400 

ttatatatat ttttggttgt caatattaaa tacagacact aagttatagt 2 460 

aagecaaett gtaaatacac cacctcac tc ctgttactta cctaaacaga 2520 

tggtttttag aaacatggtt ttgaaatget tgtggattga gggtaggagg 2 5 80 

agtgagacag aagtaagtgg ggttgcaacc actgeaaegg cttagacttc 2 640 

ccagtccctt acacgtacct ctcatcagtg tcctcttgct caaaaatctg 27 00 

gttacccaga gaatatatac attctttatc ttgacattca aggcatttct 2760 

tgatagttgg tgttcaaaaa aacactagtt ttgtgccagc cgtgatgctc 2 82 0 
tegcattatt ttgaatgtga agggaa 2 85 6 



<210> 210 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006516 



<400> 210 

aaacagatat aaatggctgg tttttagaaa catggttttg aaatgcttgt ggattgaggg 60 
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<210> 211 
<211> 576 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM__006607 
<400> 211 

atggctactc tgatctacgt tgataaggaa 
aaggatgtgc tgaagctgga gtctagacct 
gttttaacac cacgttttgg caaaacatac 
agaaaggctt tgggcactgt caacagagct 
agaaaacaaa aacagccaag cttttctgcc 
aaaagttctg ttcctgcctc agatgacgcc 
aatcttctag actttgagag ttttgacctg 
ttgagtggag tgcctctcat gatccttgat 
ctgggccccc cttcacctgt gaaaatgccc 
gtctccttca agcattctgt cgaccctgga 

<210> 212 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006607 



attggag-aac caggcacccg tgtggctgcc 60 

tcaatcaaag cattagatgg gatatctcaa 12 0 

gatgctccat cagccttacc taaagctacc 180 

acagaaaagt cagtaaagac caatggaccc 240 

aaaaagatga ccgagaagac tgttaaaaca 3 00 

tatccagaaa tagaaaaatt ctttcccttc 3 60 

cctgaagagc gccagattgc acacctcccc 42 0 

gaggagg-gag agcttgaaaa gctgtttcag 480 

tctccaccat gggaatgcaa tctgtttgca 540 
tgttga 57 6 



<400> 212 

cgcctatcca gaaatagaaa aattctttcc cttcaatctt ctagactttg agagttttga 60 

<210> 213 

<211> 2058 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006820 
<400> 213 

gcacgaggaa gccacagatc tcttaagaac tttctgtctc caaaccgtgg ctgctcgata 60 

aatcagacag aacagttaat cctcaattta agcctgatct aacccctaga aacagatata 12 0 

gaacaatgga agtgacaaca agattgacat ggaatgatga aaatcatctg cgcaactgct 180 

^ a ^ gtt tct ttgagtc ttctctataa gtctagtortt catggaggta gcattgaaga 240 

tatggttgaa agatgcagcc gtcagggatg tactataaca atggcttaca ttgattacaa 300 

tatgattgta gcctttatgc ttggaaatta tattaattta cgtgaaagtt ctacagagcc 360 

aaatgattcc ctatggtttt cacttcaaaa gaaaaatgac accactgaaa tagaaacttt 420 

^Jr^ff acagcaccaa aaattattga tgagcaactg gtgtgtcgtt tatcgaaaac 480 

ggatattttc attatatgtc gagataataa aatttatcta gataaaatga taacaagaaa 540 

cttgaaacta aggttttatg gccaccgtca gtatttgsraa tgtgaagttt ttcgagttga 600 

aggaattaag gataacctag acgacataaa gaggataa_tt aaagccagag agcacagaaa 660 

taggcttcta gcagacatca gagactatag gccctatgrca gacttggttt cagaaattcg 720 

tattcttttg gtgggtccag ttgggtctgg aaagtccagt tttttcaatt cagtcaagtc 780 

tatttttcat ggccatgtga ctggccaagc cgtagtgg-gg tctgatacca ccagcataac 840 

cgagcggtat aggatatatt ctgttaaaga tggaaaaaat ggaaaatctc tgccatttat 900 

gttgtgtgac actatggggc tagatggggc agaaggagca ggactgtgca tggatgacat 960 

tccccacatc ttaaaaggtt gtatgccaga cagatatcag tttaattccc gtaaaccaat 1020 

tacacctgag cattctactt ttatcacctc tccatctc-bg aaggacagga ttcactgtgt 1080 

ggcttatgtc ttagacatca actctattga caatctctac tctaaaatgt tggcaaaagt 1140 

craagcaagtt cacaaagaag tattaaactg tggtatagca tatgtggcct tgcttactaa 12 00 
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agtggatgat tgcagtgagg ttcttcaaga caacttttta aacatgagta gatctatgac 126 0 

ttctcaaagc cgggtcatga atgtccataa aatgctagg-c attcctattt ccaatatttt 1320 

gatggttgga aattatgctt cagatttgga actggacccc atgaaggata ttctcatcct 13 80 

ctctgcactg aggcagatgc tgcgggctgc agatgatttt ttagaagatt tgcctcttga 1440 

ggaaactggt gcaattgaga gagcgttaca gccctgcatt tgagataagt tgccttgatt 1500 

ctgacatttg gcccagcctg tactggtgtg ccgcaatgag agtcaatctc tattgacagc 15 60 

ctgcttcaga ttttgctttt gttcgttttg ccttctgtcc ttggaacagt catatctcaa 1620 

gttcaaaggc caaaacctga gaagcggtgg gctaagata_g gtcctactgc aaaccacccc 1680 

tccatatttc cgtaccattt acaattcagt ttctgtgaca tctttttaaa ccactggagg 1740 

aaaaatgaga tattctctaa tttattcttc tataacactc tatatagagc tatgtgagta 1800 

ctaatcacat tgaataatag ttataaaatt attgtataga catctgcttc ttaaacagat 1860 

tgtgagttct ttgagaaaca gcgtggattt tacttatctg tgtattcaca gagcttagca 192 0 

cagtgcctgg taatgagcaa gcatacttgc cattactttt ccttcccact ctctccaaca 1980 

tcacattcac tttaaatttt tctgtatata gaaaggaaaa ctagcctggg caacatgatg 2 040 
aaaccccatc tccactgc 2 05 8 

<210> 214 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006820 
<400> 214 

tgagttcttt gagaaacagc gtggatttta cttatctgtg tattcacaga gcttagcaca 60 

<210> 215 

<211> 2825 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006845 
<400> 215 

gcgaaattga ggtttcttgg tattgcgcgt ttctcttcct tgctgactct ccgaatggcc 60 

atggactcgt cgcttcaggc ccgcctgttt cccggtctcg ctatcaagat ccaacgcagt 12 0 

aatggtttaa ttcacagtgc caatgtaagg actgtgaact tggagaaatc ctgtgtttca 180 

gtggaatggg cagaaggagg tgccacaaag ggcaaagaga ttgattttga tgatgtggct 240 

gcaataaacc cagaactctt acagcttctt cccttacatc cgaaggacaa tctgcccttg 3 00 

■ caggaaaatg taacaatcca gaaacaaaaa cggagatccg tcaactccaa aattcctgct 3 60 

ccaaaagaaa gtcttcgaag ccgctccact cgcatgtcca ctgtctcaga gcttcgcatc 42 0 

acggctcagg agaatgacat ggaggtggag ctgcctgcag ctgcaaactc ccgcaagcag 480 

ttttcagttc ctcctgcccc cactaggcct tcctgccctg cagtggctga aataccattg 540 

aggatggtca gcgaggagat ggaagagcaa gtccattcca tccgtggcag ctcttctgca 600 

aaccctgtga actcagttcg gaggaaatca tgtcttgtg-a aggaagtgga aaaaatgaag 660 

aacaagcgag aagagaagaa ggcccagaac tctgaaatga gaatgaagag agctcaggag 72 0 

tatgacagta gttttccaaa ctgggaattt gcccgaatga ttaaagaatt tcgggctact 780 

ttggaatgtc atccacttac tatgactgat cctatcgaag agcacagaat atgtgtctgt 840 

gttaggaaac gcccactgaa taagcaagaa ttggccaaga aagaaattga tgtgatttcc 900 

attcctagca agtgtctcct cttggtacat gaacccaagt tgaaagtgga cttaacaaag 960 

tatctggaga accaagcatt ctgctttgac tttgcatttg atgaaacagc ttcgaatgaa 102 0 

gttgtctaca ggttcacagc aaggccactg gtacagacaa tctttgaagg tggaaaagca 1080 

acttgttttg catatggcca gacaggaagt ggcaagacac atactatggg cggagacctc 1140 

tctgggaaag cccagaatgc atccaaaggg atctatgcca tggcctcccg ggacgtcttc 12 00 

ctcctgaaga atcaaccctg ctaccggaag ttgggcctgg aagtctatgt gacattcttc 12 60 

gagatctaca atgggaagct gtttgacctg ctcaacaaga aggccaagct gcgcgtgctg 1320 

gaggacggca agcaacaggt gcaagtggtg gggctgca^gg agcatctggt taactctgct 13 80 

gatgatgtca tcaagatgct cgacatgggc agcgcctgca gaacctctgg gcagacattt 1440 

gccaactcca attcctcccg ctcccacgcg tgcttccaLaa ttattcttcg agctaaaggg 1500 
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agaatgcatg gcaagttctc tttggtagat ctggcaggga atgagcgagg cgcagacact 15 60 

tccagtgctg accggcagac ccgcatggag- ggcgcagaaa tcaacaagag tctcttagcc 162 0 

ctgaaggagt gcatcagggc cctgggacag- aacaaggctc acaccccgtt ccgtgagagc 16 80 

aagctgacac aggtgctgag ggactccttc attggggaga actctaggac ttgcatgatt 1740 

gccacgatct caccaggcat aagctcctgt gaatatactt taaacaccct gagatatgca 1800 

gacagggtca aggagctgag cccccacagt gggcccagtg gagagcagtt gattcaaatg 186 0 

gaaacagaag agatggaagc ctgctctaac ggggcgctga ttccaggcaa tttatccaag 192 0 

gaagaggagg aactgtcttc ccagatgtcc agctttaacg aagccatgac tcagatcagg 1980 

gagctggagg agaaggctat ggaagagctc aaggagatca tacagcaagg accagactgg 2 040 

cttgagctct ctgagatgac cgagcagcca gactatgacc tggagacctt tgtgaacaaa 2100 

gcggaatctg ctctggccca gcaagccaag catttctcag ccctgcgaga tgtcatcaag 2160 

gccttacgcc tggccatgca gctggaagacj caggctagca gacaaataag cagcaagaaa 2220 

cggccccagt gacgactgca aataaaaato tgtttggttt gacacccagc ctcttccctg 22 80 

gccctcccca gagaactttg ggtacctggt gggtctaggc agggtctgag ctgggacagg 2340 

ttctggtaaa tgccaagtat gggggcatcfc gggcccaggg cagctgggga gggggtcaga 2400 

gtgacatggg acactccttt tctgttcctc agttgtcgcc ctcacgagag gaaggagctc 2460 

ttagttaccc ttttgtgttg cccttctttc catcaagggg aatgttctca gcatagagct 2520 

ttctccgcag catcctgcct gcgtggactcj gctgctaatg gagagctccc tggggttgtc 2580 

ctggctctgg ggagagagac ggagccttteL gtacagctat ctgctggctc taaaccttct 2 640 

acgcctttgg gccgagcact gaatgtcttcr tactttaaaa aaatgtttct gagacctctt 2700 

tctactttac tgtctcccta gagtcctaga. ggatccctac tgttttctgt tttatgtgtt 27 60 

tatacattgt atgtaacaat aaagagaaaa aataaaaaaa aaaaaaaaaa aaaaaaaaaa 2 82 0 
aaaaa 2 825 

<210> 216 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_006845 
<400> 216 

aaatgtttct gagacctctt tctactttac tgtctcccta gagtcctaga ggatccctac 60 

<210> 217 
<211> 823 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_007019 
<400> 217 

aaacgcgggc gggcgggccc gcagtcctg-c agttgcagtc gtgttctccg agttcctgtc 60 

tctctgccaa cgccgcccgg atggcttccc aaaaccgcga cccagccgcc actagcgtcg 12 0 

ccgccgcccg taaaggagct gagccgagcg- ggggcgccgc ccggggtccg gtgggcaaaa 180 

ggctacagca ggagctgatg accctcatga tgtctggcga taaagggatt tctgccttcc 240 

ctgaatcaga caaccttttc aaatgggtag- ggaccatcca tggagcagct ggaacagtat 3 00 

atgaagacct gaggtataag ctctcgctag agttccccag tggctaccct tacaatgcgc 3 60 

ccacagtgaa gttcctcacg ccctgctatc accccaacgt ggacacccag ggtaacatat 42 0 

gcctggacat cctgaaggaa aagtggtctg- ccctgtatga tgtcaggacc attctgctct 480 

ccatccagag ccttctagga gaacccaaca ttgatagtcc cttgaacaca catgctgccg 540 

agctctggaa aaaccccaca gcttttaag-a. agtacctgca agaaacctac tcaaagcagg 600 

tcaccagcca ggagccctga cccaggctg-c ccagcctgtc cttgtgtcgt ctttttaatt 660 

tttccttaga tggtctgtcc tttttgtgett ttctgtatag gactctttat cttgagctgt 72 0 

ggtatttttg ttttgttttt gtcttttaa_a. ttaagcctcg gttgagccct tgtatattaa 780 
ataaatgcat ttttgtcctt ttttagaca_a. aaaaaaaaaa aaa 823 

<210> 218 
<211> 60 
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<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_007019 
<400> 218 

tggaaaaacc ccacagcttt taagaagtac ctgcaagaaa cctactcaaa gcaggtcacc 60 

<210> 219 

<211> 2831 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_007183 
<400> 219 

gaattccgga caggacgtga agatagttgg gtttggaggc ggccgccagg cccaggcccg 60 

gtggacctgc cgccatgcag gacggtaact tcctgctgtc ggccctgcag cctgaggccg 12 0 

gcgtgtgctc cctggcgctg ccctctgacc tgcagctgga ccgccggggc gccgaggggc 180 

cggaggccga gcggctgcgg gcagcccgcg tccaggagca ggtccgcgcc cgcctcttgc 240 

agctgggaca gcagccgcgg cacaacgggg ccgctgagcc cgagcctgag gccgagactg 3 00 

ccagaggcac atccaggggg cagtaccaca ccctgcaggc tggcttcagc tctcgctctc 3 60 

agggcctgag tggggacaag acctcgggct tccggcccat cgccaagccg gcctacagcc 42 0 

cagcctcctg gtcctcccgc tccgccgtgg atctgagctg cagtcggagg ctgagttcag 480 

cccacaatgg gggcagcgcc tttggggocg ctgggtacgg gggtgcccag cccacccctc 540 

ccatgcccac caggcccgtg tccttccatg agcgcggtgg ggttgggagc cgggccgact 600 

atgacacact ctccctgcgc tcgctgcggc tggggcccgg gggcctggac gaccgctaca 660 

gcctggtgtc tgagcagctg gagcccgcgg ccacctccac ctacagggcc tttgcgtacg 72 0 

agcgccaggc cagctccagc tccagccggg caggggggct ggactggccc gaggccactg 7 80 

aggtttcccc gagccggacc atccgtgccc ctgccgtgcg gaccctgcag cgattccaga 840 

gcagccaccg gagccgcggg gtaggcgg-gg cagtgccggg ggccgtcctg gagccagtgg 900 

ctcgagcgcc atctgtgcgc agcctcagcc tcagcctggc tgactcgggc cacctgccgg 960 

acgtgcatgg gttcaacagc tacggtagcc accgaaccct gcagagactc agcagcggtt 102 0 

ttgatgacat tgacctgccc tcagcagtca agtacctcat ggcttcagac cccaacctgc 1080 

aggtgctggg agcggcctac atccagcaca agtgctacag cgatgcagcc gccaagaagc 1140 

aggcccgcag ccttcaggcc gtgcctaggc tggtgaagct cttcaaccac gccaaccagg 12 00 

aagtgcagcg ccatgccaca ggtgccatgc gcaacctcat ctacgacaac gctgacaaca 12 60 

agctggccct ggtggaggag aacgggatct tcgagctgct gcggacactg cgggagcagg 132 0 

atgatgagct tcgcaaaaat gtcacag-g-ga tcctgtggaa cctttcatcc agcgaccacc 13 80 

tgaaggaccg cctggccaga gacacgctgg agcagctcac ggacctggtg ttgagccccc 1440 

tgtcgggggc tgggggtccc cccctcatcc agcagaacgc ctcggaggcg gagatcttct 15 00 

acaacgccac cggcttcctc aggaacctca gctcagcctc tcaggccact cgccagaaga 1560 

tgcgggagtg ccacgggctg gtggacg-occ tggtcacctc tatcaaccac gccctggacg 162 0 

cgggcaaatg cgaggacaag agcgtgg-^aga acgcggtgtg cgtcctgcgg aacctgtcct 1680 

accgcctcta cgacgagatg ccgccgtccg cgctgcagcg gctggagggt cgcggccgca 1740 

gggacctggc gggggcgccg ccgggag-eigg tcgtgggctg cttcacgccg cagagccggc 1800 

ggctgcgcga gctgcccctc gccgccg-eitg cgctcacctt cgcggaggtg tccaaggacc 1860 

ccaagggcct cgagtggctg tggagccccc agatcgtggg gctgtacaac cggctgctgc 192 0 

agcgctgcga gctcaaccgg cacacgacgg aggcggccgc cggggcgctg cagaacatca 1980 

cggcaggcga ccgcaggtgg gcggggg"tgc tgagccgcct ggccctggag caggagcgta 2 040 

ttctgaaccc cctgctagac cgtgtca^ga ccgccgacca ccaccagctg cgctcactga 2100 

ctggcctcat ccgaaacctg tctcggaacg ctaggaacaa ggacgagatg tccacgaagg 2160 

tggtgagcca cctgatcgag aagctgccag gcagcgtggg tgagaagtcg cccccagccg 222 0 

aggtgctggt caacatcata gctgtgctca acaacctggt ggtggccagc cccatcgctg 22 80 

cccgagacct gctgtatttt gacggactcc gaaagctcat cttcatcaag aagaagcggg 2340 

acagccccga cagtgagaag tcctcccggg cagcatccag cctcctggcc aacctgtggc 2400 

agtacaacaa gctccaccgt gactttCQ-gg cgaagggcta tcggaaggag gacttcctgg 2460 

gcccataggt gaagccttct ggaggagaag gtgacgtggc ccagcgtcca agggacagac 2520 

tcagctccag gctgcttggc agcccagcct ggaggagaag gctaatgacg gaggggcccc 2580 
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tcgctggggc ccctgtgtgc atctttgagg gtcctgggcc accaggaggg gcagggtctt 2640 

atagctgggg acttggcttc cgcagggcag ggggtggggc agggctcaag gctgctctgg 2700 

tgtatggggt ggtgacccag tcacattggc agaggtgggg gttggctgtg gcctggcagt 2760 

atcttgggat agccagcact gggaataaag atggccatga acagtcacaa aaaaaaaaaa 2 82 0 
aaaaggaatt c 2 831 



<210> 220 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_007183 



<400> 220 

ctggcagtat cttgggatag ccagcactgg gaataaagat ggccatgaac agtcacaaaa 60 

<210> 221 

<211> 2815 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_007267 



<400> 221 

aggaagcgga ggaaggtgaa gtaggaccga. attcctgtgc cgaagaggcc tgcagtggga 60 

gagcaggatg ggggctccgg aggtggcgcc caggctctga gctaccctag gtctgcagac 12 0 

tagcgggcat tggccagaga catggcccag ccactggcct tcatcctcga tgtccctgag 180 

accccagggg accagggcca gggccccagc ccctatgatg aaagcgaagt gcacgactcc 240 

ttccagcagc tcatccagga gcagagccag tgcacggccc aggaggggct ggagctgcag 3 00 

cagagagagc gggaggtgac aggaagtagc cagcagacac tctggcggcc cgagggcacc 3 60 

cagagcacgg ccacactccg catcctggcc agcatgccca gccgcaccat tggccgcagc 42 0 

cgaggtgcca tcatctccca gtactacaac cgcacggtgc agcttcggtg caggagcagc 480 

cggcccctgc tcgggaactt tgtccgctcc gcctggccca gcctccgcct gtacgacctg 540 

gagctggacc ccacggccct ggaggaggag- gagaagcaga gcctcctggt gaaggagttc 600 

cagagcctgg cagtggcaca gcgggaccac atgcttcgcg ggatgccctt aagcctggct 660 

gagaaacgca gcctgcgaga gaagagcagg accccgaggg ggaagtggag gggccagccg 72 0 

ggcagcggcg gggtctgctc ctgctgtggc cggctcagat atgcctgcgt gctggccttg 7 80 

cacagcctgg gcctggcgct gctctccgcc ctgcaggccc tgatgccgtg gcgctacgcc 840 

ctgaagcgca tcgggggcca gttcggctcc agcgtgctct cctacttcct ctttctcaag 900 

accctgctgg ctttcaatgc cctcctgctg ctgctgctgg tggccttcat catgggccct 960 

caggtcgcct tcccacccgc cctgccgggc cctgcccccg tctgcacagg cctggagctc 102 0 

ctcacaggcg cgggttgctt cacccacacc gtcatgtact acggccacta cagtaacgcc 1080 

acgctgaacc agccgtgtgg cagccccctg gatggcagcc agtgcacacc cagggtgggt 1140 

ggcctgccct acaacatgcc cctggcctac ctctccactg tgggcgtgag cttctttatc 12 00 

acctgcatca ccctggtgta cagcatggct cactctttcg gggagagcta ccgggtgggc 12 60 

agcacctctg gcatccacgc catcaccgtc ttctgctcct gggactacaa ggtgacgcag 132 0 

aagcgggcct cccgcctcca gcaggacaat attcgcaccc ggctgaagga gctgctggcc 13 80 

gagtggcagc tgcggcacag ccccaggagc gtgtgcggga ggctgcggca ggcggctgtg 1440 

ctggggcttg tgtggctgct gtgtctgggg- accgcgctgg gctgcgccgt ggccgtccac 1500 

gtcttctcgg agttcatgat ccagagtcca gaggctgctg gccaggaggc tgtgctgctg 156 0 

gtcctgcccc tggtggttgg cctcctcaac ctgggggccc cctacctgtg ccgtgtcctg 162 0 

gccgccctgg agccgcatga ctccccggta ctggaggtgt acgtggccat ctgcaggaac 1680 

ctcatcctca agctggccat cctggggaca ctgtgctacc actggctggg ccgcagggtg 1740 

ggcgtcctgc agggccagtg ctgggaggat tttgtgggcc aggagctgta ccggttcctg 180 0 

gtgatggact tcgtcctcat gttgctggao acgctttttg gggaactggt gtggaggatt 186 0 

atctccgaga agaagctgaa gaggaggcgg- aagccggagt ttgacattgc ccggaatgtc 192 0 

ctggagctga tttatgggca gactctgacc tggctggggg tgctcttctc gcccctcctc 1980 

cccgccgtgc agatcatcaa gctgctgctc gtcttctatg tcaagaagac cagccttctg 2040 

gccaactgcc aggcgccgcg ccggccctgg ctggcctcac acatgagcac cgtcttcctc 210 0 



100 



WO 2005/086891 



PCT/US2005/007894 



acgctgctct 
caggtgaagc 
ggcagggtgt 
gtgcaccggt 
gccgtgatct 
aaggagcaga 
atctacgaga 
gcaccccctg 
gggcccgccc 
ggacaaggaa 
gggtgggtga 
attctgatta 



gcttccccgc 
cctcgagcac 
gggtgcgcca 
acctgatgga 
acctcaacat 
tcagcaatga 
ggaaggagag 
ccctgctcac 
agcaccctga 
gacagtttcg 
atccggcttg 
aaggggtttt 



cttcctgggc 
ctgcggcccc 
cctggaggcg 
aaacaccttc 
ccaggtggtg 
ggg"tgaggac 
ggaggagagg 
agatgaacag 
gaccacactg 
cctctcgaaa 
ggcatcccca 
gcaaatggga 



gccgctgtct 
ttccggaccc 
gcaggcccca 
tttgtcttcc 
cggggccagc 
aaaatcttct 
agcagggttg 
gatgcctagg 
ttgcctccca 
gccgcagctg 
atgaactctg 
aaaaaaaaaa 



tcctctgcta 
tggacaccat 
gggtctcctg 
tggtgtcagc 
gcaaggtcat 
taatcaacaa 
ggacaaccga 
gggacggcga 
gtgaccctgc 
cgcctaggct 
ccctgcctgg 
aaaaaaaaaa 



cgccgtctgg 
gtacgaggcc 
gctgccctgg 
cctgctgctg 
ctgcctgctc 
gcttcactcc 
ggaggctgcg 
tgggcctcac 
tgggacacca 
ggagctggaa 
gactctattt 
aaaaa 2 815 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



<210> 222 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_007267 



<400> 222 

ggtgaggaca aaatcttctt aatcaacaag cttcactcca tctacgagag gaaggagagg 6 0 



<210> 223 

<211> 1893 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_007274 



<400> 223 

atttaccgcc 

ccagacgctt 

tccccgcacc 

ccccacgccg 

gcggggctcg 

tcgcccccgg 

tttatggcgc 

ctgcagccgc 

acgccgtccg 

aatgtccacg 

cggcattgca 

accgacttcc 

tacacctcca 

ggtgccaaaa 

gtggacaagg 

ggccggaagc 



gcgcggagag 
ccctcggggc 
ggcgcggccc 
ccagaatcgt 
gggtggggcc 
cgcgcaggcc 
ggcccgggct 
ccgccgcatc 
ccatccagat 
gggggaccat 
acagccagaa 
tgtctcccat 
agcactctgt 
agctgaccaa 
tcctcgaggt 
ggtatgaagc 



tgagggccca 
tgccaccggg 
aaccgccacc 
acgcccgcgc 
ggcgcggggg 
ccgcctcacc 
cattcattcc 
cgccgccgca 
ctcj-ccggatc 
cctgaagatg 
cgcj-crgagcgc 
gtcjcatcggt 
gga-ggtgcag 
taaggccacc 
gcctcctgtt 
ccagaagctg 



agtccgccct 
tcgggcgcgg 
gaaccttctg 
gagctctctg 
cggggtcggc 
agccccatcg 
gcgccgggcc 
gcccccagca 
atgcggccag 
atcgaggagg 
tgtgtggccg 
gaggtggcgc 
gtcaacgtga 
ctgtggtatg 
gtgtattccc 
gagcgcatgg 



gctccgccac 
ctgccgcggc 
gaagcggcgg 
cagccttggc 
gcggggaggc 
ctccacctct 
tgccagacac 
tgtcgggccc 
atgatgccaa 
caggcgccat 
ccctggctcg 
atgtcagcgc 
tgtccgaaaa 
tgcccctgtc 
ggcaggagca 
agaccaagtg 



ttaggccgcc 
tagcgggcct 
ctgcctgggc 
ggcctgggag 
cgcgttcgat 
gccctccccc 
ctgcgccctt 
agacgtcgag 
cgtggccggc 
catcagcacc 
tgtcgagcgc 
ggagatcacc 
catcctcaca 
gctgaagaat 
ggaggaggag 
gaggaacggg 



60 

3.20 

180 

240 

300 

360 

420 

480 

540 

600 

660 

-720 

-780 

S40 

900 

960 



gacatcgtcc agccagtcct caacccagag ccgaacactg tcagctacag ccagtccagc 102 0 

ttgatccacc tggtggggcc ttcagactgc accctgcacg gctttgtgca cggaggtgtg 1080 

accatgaagc tcatggatga ggtcgccggg atcgtggctg cacgccactg caagaccaac 3.140 

atcgtcacag cttccgtgga cgccattaat tttcatgaca agatcagaaa aggctgcgtc 3.2 00 

atcaccatct cgggacgcat gaccttcacg agcaataagt ccatggagat cgaggtgttg 12 60 

gtggacgccg accctgttgt ggacagctct cagaagcgct accgggccgc cagtgccttc 3.32 0 

ttcacctacg tgtcgctgag ccaggaaggc aggtcgctgc ctgtgcccca gctggtgccc 3_3 80 

gagaccgagg acgagaagaa gcgrctttgag gaaggcaaag ggcggtacct gcagatgaag 3.440 

gcgaagcgac agggccacgc ggagcctcag ccctagactc cctcctcctg ccactggtgc 3_500 

ctcgagtagc catggcaacg ggcccagtgt ccagtcactt agaagttccc cccttggcca 3_5 60 

aaaacccaat tcacattgag agctggtgtt gtctgaagtt ttcgtatcac agtgttaacc 162 0 
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tgtactctct cctgcaaacc tacacaccaa agctttattt atatcattcc agtatcaatg 1680 

ctacacagtg ttgtcccgag cgccg-ggagg cgttgggcag aaaccctcgg gaatgcttcc 1740 

gagcacgctg tagggtatgg gaagaaccca gcaccactaa taaagctgct gcttggctgg 1800 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1860 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 1893 

<210> 224 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_007274 



<400> 224 

acctacacac caaagcttta tttatatcat tccagtatca atgctacaca gtgttgtccc 



60 



<210> 225 

<211> 4157 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_007315 



<400> 225 

agcggggcgg 

gagtctgcgg 

cagaggcgac 

ggatcctgcg 

aacgttcgca 

ctgctccttt 

tggtacgaac 

gacagttttc 

gagcacgctg 

ctggatgatc 

aggaaaagca 

atcatttaca 

caggctcagt 

agtaaagtca 

gaagatttac 

accaatggtg 

ttaatgcttg 

actgaactta 

cagagcgcct 

actatagttg 

gaacagaaat 

cgcaccttca 

tgcatgccaa 

aagttgagac 

tttgataaag 

ggcacgcaca 

tttcggcacc 

ctcatcgtta 

ttggtaattg 

ctcccgagcg 

ctgtccttct 

tggcagtttt 

gagaagcttc 

aaggaaaata 

ctcattaaaa 



ggcgccagcg 
aggggctcgg 
ccagcgcgct 
cgcagaaaag 
ctctgtgtat 
ggttgaatcc 
ttcagcagct 
ccatggaaat 
ccaatgatgt 
aatatagtcg 
agcgtaatct 
gctgtctgaa 
cggggaatat 
gaaatgtgaa 
aagatgaata 
tggcaaagag 
acaataagag 
cccagaatgc 
gtattggggg 
cggagagtct 
acacctacga 
gtcttttcca 
cgcaccctca 
tgttggtgaa 
atgtgaatga 
caaaagtgat 
tgcaattgaa 
ctgaagagct 
acctcgagac 
gttgggcctc 
tcctgactcc 
cttctgtcac 
ttggtcctaa 
taaatgataa 
aacacctgct 



ctgccttttc 
ctgcaccggg 
cgggagaggc 
tttca_tttgc 
ataacctcga 
ccaggccctt 
tgactcaaaa 
cagacagtac 
ttcatttgcc 
cttttctttg 
tcaggj-ataat 
ggaagaaagg 
tcagagcaca 
ggacaaggtt 
tgacttcaaa 
tgatcagaaa 
aaagcj-aagta 
cctgattaat 
gccgcccaat 
gcagcaagtt 
acatgraccct 
gcagctcatt 
gaggccgctg 
attgcaagag 
gagaaataca 
gaacatggag 
agaac agaaa 
tcactccctt 
gacctctctg 
catcctttgg 
accatgtgca 
caaaagaggt 
cgccagcccc 
aaattttccc 
ccctctctgg 



tcctgccggg 
gggatcgcgc 
tgcaccgccg 
tgtatgccat 
cagtcttggc 
gttggggcac 
ttcctggagc 
ctggcacagt 
accatccgtt 
gagaataact 
tttcaggaag 
aaaattctgg 
gtgatgttag 
atgtgtatag 
tgcaaaacct 
caagaacagc 
gttcacaaaa 
gatgaactag 
gcttgcttgg 
cggcagcagc 
atcacaaaaa 
cagagctcgt 
gtcttgaaga 
ctgaattata 
gtaaaaggat 
gagtccacca 
aatgctggca 
agttttgaaa 
cccgttgtgg 
tacaacatgc 
cgatgggctc 
ctcaatgtgg 
gatggtctca 
ttctggcttt 
aatgatgggt 



tagtttcgct 
ctggcagacc 
cgcccccgcc 
cctcgagagc 
acctaacgtg 
aaggtggcag 
aggttcacca 
ggttagaaaa 
ttcatgacct 
tcttgctaca 
acccaatcca 
aaaacgccca 
acaaacagaa 
agcatgaaat 
tgcagaacag 
tgttactcaa 
taatagagtt 
tggagtggaa 
atcagctgca 
ttaaaaagtt 
acaaacaagt 
ttgtggtgga 
caggggtcca 
atttgaaagt 
ttaggaagtt 
atggcagtct 
ccagaacgaa 
cccaattgtg 
tgatctccaa 
tggtggcgga 
agctttcaga 
accagctgaa 
ttccgtggac 
ggattgaaag 
gcatcatggg 



ttcctgcgca 
ccagaccgag 
tagcccttcc 
tgtctaggtt 
ctgtgcgtag 
gatgtctcag 
gctttatgat 
gcaagactgg 
cctgtcacag 
gcataacata 
gatgtctatg 
gagatttaat 
agagcttgac 
caagagcctg 
agaacacgag 
gaagatgtat 
gctgaatgtc 
gcggagacag 
gaactggttc 
ggaggaattg 
gttatgggac 
aagacagccc 
gttcactgtg 
caaagtctta 
caacattttg 
ggcggctgaa 
tgagggtcct 
ccagcctggt 
cgtcagccag 
acccaggaat 
agtgctgagt 
catgttggga 
gaggttttgt 
catcctagaa 
cttcatcagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 



102 



WO 2005/086891 



PCT/US2005/007894 



aaggagcgag agcgtgccct gttgaaggac cagcagccgg ggaccttcct gctgcggttc 2160 

agtgagagct cccgggaagg ggccatcaca ttcacatggg tggagcggtc ccagaacgga 2220 

ggcgaacctg acttccatgc ggttgaaccc tacacgaaga aagaactttc tgctgttact 22 80 

ttccctgaca tcattcgcaa ttacaaagtc atggctgctg agaatattcc tgagaatccc 2 340 

ctgaagtatc tgtatccaaa tattgacaaa gaccatgcct ttggaaagta ttactccagg 2 400 

ccaaaggaag caccagagcc aatggaactt gatggcccta aaggaactgg atatatcaag 2460 

actgagttga tttctgtgtc tgaagttcac ccttctagac ttcagaccac agacaacctg 2520 

ctccccatgt ctcctgagga gtttgacgag gtgtctcgga tagtgggctc tgtagaattc 2 5 80 

gacagtatga tgaacacagt atagagcatg aatttttttc atcttctctg gcgacagttt 2 640 

tccttctcat ctgtgattcc ctcctgctac tctgttcctt cacatcctgt gtttctaggg 2700 

aaatgaaaga aaggccagca aattcgctgc aacctgttga tagcaagtga atttttctct 27 60 

aactcagaaa catcagttac tctgaagggc atcatgcatc ttactgaagg taaaattgaa 2 82 0 

aggcattctc tgaagagtgg gtttcacaag tgaaaaacat ccagatacac ccaaagtatc 2 880 

aggacgagaa tgagggtcct ttgggaaagg agaagttaag caacatctag caaatgttat 2 940 

gcataaagtc agtgcccaac tgttataggt tgttggataa atcagtggtt atttagggaa 3 0 00 

ctgcttgacg taggaacggt aaatttctgt gggagaattc ttacatgttt tctttgcttt 3 0 60 

aagtgtaact ggcagttttc cattggttta cctgtgaaat agttcaaagc caagtttata 312 0 

tacaattata tcagtcctct ttcaaaggta gccatcatgg atctggtagg gggaaaatgt 3180 

gtattttatt acatctttca cattggctat ttaaagacaa agacaaattc tgtttcttga 3240 

gaagagaata ttagctttac tgtttgttat ggcttaatga cactagctaa tatcaataga 33 00 

aggatgtaca tttccaaatt cacaagttgt gtttgatatc caaagctgaa tacattctgc 33 60 

tttcatcttg gtcacataca attattttta cagttctccc aagggagtta ggctattcac 3420 

aaccactcat tcaaaagttg aaattaacca tagatgtaga taaactcaga aatttaattc 3 480 

atgtttctta aatgggctac tttgtccttt ttgttattag ggtggtattt agtctattag 3 540 

ccacaaaatt gggaaaggag tagaaaaagc agtaactgac aacttgaata atacaccaga 3 6 00 

gataatatga gaatcagatc atttcaaaac tcatttccta tgtaactgca ttgagaactg 3 6 60 

catatgtttc gctgatatat gtgtttttca catttgcgaa tggttccatt ctctctcctg 3720 

tactttttcc agacactttt tfc^agtggat gatgtttcgt gaagtatact gtatttttac 3 7 80 

ctttttcctt ccttatcact gacacaaaaa gtagattaag agatgggttt gacaaggttc 3 840 

ttccctttta catactgctg tctatgtggc tgtatcttgt ttttccacta ctgctaccac 3 9 00 

aactatatta tcatgcaaat gctgtattct tctttggtgg agataaagat ttcttgagtt 3 9 60 

ttgttttaaa attaaagcta aagtatctgt attgcattaa atataatatg cacacagtgc 4020 

tttccgtggc actgcataca atctgaggcc tcctctctca gtttttatat agatggcgag 4080 

aacctaagtt tcagttgatt ttacaattga aatgactaaa aaacaaagaa gacaacatta 4140 
aaacaatatt gtttcta 4157 



<210> 226 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_007315 



<400> 226 

atcagatcat ttcaaaactc atttcctatg taactgcatt gagaactgca tatgtttcgc 6 0 



<210> 227 

<211> 1696 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_009587 



<400> 227 

caaaggactt cctagtgggt gtgaaaggca 
ccttcagcgg ttcccaggct ccctacctga 
aaggaggtct ccaggacgga cttcagatca 
gaaccaggtt tgctgtgaac tttcagactg 
tcaaccctcg gtttgaagat gg-agggtacg 



gcggtggcca cagaggcggc ggagagatgg 60 

gtccagctgt ccccttttct gggactattc 12 0 

ctgtcaatgg gaccgttctc agctccagtg 180 

gcttcagtgg aaatgacatt gccttccact 240 

tggtgtgcaa cacgaggcag aacggaagct 3 00 
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gggggcccga 
gcttcctggt 
acttccaccg 
tgtcctacat 
tgccgttctc 
ccggcgtgtg 
gcgcccctgg 
cctatccgat 
tcctgtcagg 
accacatcgc 
agatcgacaa 
gtggccagag 
atggtcagca 
tggaagtggg 
ggggccgggg 
ccagcctttc 
gtcctgcttc 
ctcagccgca 
cacagccagg 
ccatccccca 
gtggcctcct 
caacccttca 
accagagtgt 
aaaatgcttg 



ggagaggaag 
gcagagctca 
cgtgcccttc 
cagcttccag 
ccagcctgtc 
gcctgccaac 
acagatgttc 
gcctttcatc 
cactgtcctg 
cttccacctg 
ctcctggggg 
cttctcagtg 
cctgtttgaa 
gggcgacatc 
gctggggtgt 
caaccctgcc 
tggctacagc 
gcagcacctg 
gagaggggag 
cgcagctcca 
cagcccctcc 
cccctcctgg 
tctcttcaga 
ttggca 169 



acacacatgc 
gatttcaagg 
caccgtgtgg 
aacccccgca 
tgtttcccac 
ccggctccca 
fcctactcccg 
accaccattc 
cccagtgctc 
aacccccgtt 
tctgaggagc 
fcggatcttgt 
tactaccatc 
cagctgaccc 
g-gggcagtct 
fcgggatctgg 
caccctggaa 
g-ggctccagc 
g-agtgggcag 
ccccagtccc 
tctctgacct 
aaagcaggcc 
grgactggctc 
S 



ctttccagaa 
tgatggtgaa 
acaccatctc 
cagtccctgt 
ccaggcccag 
ttacccagac 
ccatcccacc 
tgggagggct 
agaggttcca 
ttgatgagaa 
gaagtctgcc 
gtgaagctca 
gcctgaggaa 
atgtgcagac 
gggtcctctc 
gctttaatgc 
cggagaaggc 
tgctggaatc 
tgaagatgaa 
aagccaccag 
ttaacctcac 
tgatggcttc 
ctttcccagt 



ggggatgccc 

cgggatcctc 
cgtcaatggc 
tcagcctgcc 

ggggcgcaga 

agtcatccac 
tatgatgtac 
gtacccatcc 
catcaacctg 
tgctgtggtc 
ccgaaaaatg 
ctgcctcaag 
cctgcccacc 
ataggcggct 
atcatcccca 
agaggccatg 
agctgacggg 
ctaccatccc 
gccccatgct 
ctgtctgctc 
tctcaccttg 
ccactggcct 
gtccttaaaa 



tttgacctct 
ttcgtgcagt 
tctgtgcagc 
ttctccacgg 
caaaaacctc 
acagtgcaga 
ccccaccccg 
aagtccatcc 
tgcfcctggga 
cgcaacaccc 
cccttcgtcc 
gtggccgtgg 
atcaacagac 
tcctggccct 
cttcccaggc 
tccttgtctg 
gattgccttc 
aggaggcagg 
cagtcccctc 
ctggtgggag 
caccgtgcac 
ccaccacctg 
taaagaaatg 



3 60 
420 

4 80 

5 40 

5 00 

6 60 
720 

7 80 
840 
9 00 
9 60 
1 020 
1 080 
1 140 
L200 
1260 
1320 
1380 
1440 
1500 
1 560 
1 620 
1 680 



<210> 228 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_009587 



<400> 228 

cagaggactg gctcctttcc cagtgtcctt aaaataaaga aatgaaaatg cttgttggca 



S 0 



<210> 229 
<211> 6552 
<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_012291 



<400> 229 

atgaggagct 

gagttgctgc 

cgatctgatg 

ctgactgcta 

gcctgtgatg 

ctctttgtct 

gctcagcccc 

gaggccgtgg 

cggcgagctg 

gaaagtaccc 

gcccatcagc 

gatgacctgc 

gggcttcttt 

cgcttttgct 

tacctaagga 



tcaaaagagt 
ccgacttgaa 
ctgagaggag 
agctagcttg 
gctacttagt 
tactgcggaa 
tccatgcctg 
ctcggggcag 
catttgcagc 
cttgtgaggt 
tatttgatgc 
tctccaggca 
ctccccagag 
ggagccgcca 
acaccaatct 



caactttggg 
g-gagttcctg 
acaagcttgt 
ccctaggcat 
gtctacccca 
tgctgctgca 
ottggtgcag 
cttttctctg 
tcggctgaag 
tcctcacttt 
cagtggccat 
cgtgatcaga 
ggccctctgc 
ccatgacaaa 
agcccctagc 



actctgctaa 
tccaaccctc 
gatgccatcc 
ctggggagcc 
cagcgtcctc 
caaggaagcc 
tgctctcgcg 
ctttggaagg 
gccttgagct 
gcttctccaa 
ggtctaaatg 
gccttggtgg 
ctcttggagc 
gccatcagcg 
cttcagctat 



gcagccagaa 
cagctggttt 
tgagggcttg 
tgctggagct 
ccctctacct 
cagaggccac 
aggctgctcc 
gggcagaagc 
tcctagtact 
cagcctgtcg 
aagcagatgc 
gtgagagagg 
tcaccttgga 
cagtggagaa 
gtcagctggg 



ggaggctgaa 
tcccagcagc 
caaccagcag 
ggcagagctg 
ggaacgaatt 
actccgcctt 
ccaggactat 
cctgttggaa 
cttggaggat 
agcggtagct 
tgatttccta 
gagctcttct 
acactgccgt 
ggctcacagt 
ggttaagctg 



5 0 
120 
180 

2 40 

3 00 
3 60 
420 
480 
540 

6 00 
660 
720 

7 80 
840 
9 00 
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ctgcaggttg 
gtcctgagca 
cagttcttcc 
ctgagcctct 
ggtgtgtatg 
cttcacctct 
gctgacctga 
ttagagggcc 
agtaatttgg 
gagccgctct 
gagaagttgc 
gcccagggct 
cacatggctg 
gacaaggagc 
ctggccctcc 
caggaacgct 
ggggcctggg 
tttacgcagc 
gactctgtga 
ttgctgtggc 
cggagagccc 
gataaactcc 
gctgctcagt 
gggcaggccc 
gccctctacc 
attgtctctg 
cagctcctcc 
tcgagcctga 
gatctgcttc 
ctgctgtctg 



gggaggaagg 
agagtatgga 
tttcaggcct 
ttgcttttct 
ggggctcctc 
acactgtggt 
cccaactagt 
tgtcgggcca 
cctacagctt 
gtcagcacct 
acaggtgctt 
gcaagatggt 
agccagtcac 
tacagctaaa 
tgctgaggga 
tcaacatcat 
cacgagccac 
agaccaactg 
ggcctgaggc 
tttacatctg 
aggcccctgg 
aggaagatcg 
ccaaatgcct 
cagctgtacg 
agctggtggc 
agagactgaa 
tgaccctcgg 
agcatctcga 
gaagtcaact 
tgcttcggga 



acctcaggca 
ggcaccatca 
ggaacgaggc 

tgg-a.gggtac 

caagcaacag 
ggtttatgac 
ggacagttgt 
agagctgacg 
ctatagtcac 
ggg-tttggtg 
ccggctacaa 
gattttgtgg 
ttt ctgggtt 
gactctgcga 
ggagctgcag 
ctgtgacctc 
ccacctggta 
ctctgctctg 
ccaggccaga 
tactctggaa 
taacttggag 
ttt cctatac 
ggaccaagcc 
gtgtctccag 
aaagcccatg 
ggaccactcg 
ctgtcccagc 
tcagactact 
ctactggact 
tcctgccctc 



gtggccaagc 
cccccacttc 
accaagaggc 
tgctctcttc 
cagtcttttc 
tttgcccaag 
aaatctaccg 
gaccacatgg 
aagctctatg 
aagccaggca 
gtagagagtt 
ctggcagccc 
cgggtcaaga 
gacagcctca 
gcctacaagg 
ctggagctga 
gaactggctc 
gatgctatcc 
gatcagcttc 
gccaaaatac 
gaatttgaag 
agtaacattg 
ctggccctgt 
cagacagcag 
caggctctgg 
aaggcagctg 
tatgcccagt 
gacacatacc 
caccagaagg 
cagaagtcct 



ttctgatcaa 
gggcattgta 
gctatagact 
tgcagcagct 
ttcagatgta 
gctgtcagat 
ttgtctggat 
ggatgaccgc 
ccgaggcctg 
cttatcccga 
tgaagaaact 
tgcaaccctg 
tggatgcggc 
gtggctggga 
cggtgcgggc 
gccccgagga 
aggtgctctg 
gggaagccct 
tggacgataa 
aggaaggtat 
tcaatgacct 
ccttcaacct 
ggaaggagct 
cctcactgca 
aggtcctcct 
gctcctcctg 
tacacctgga 
tgctcctttc 
tgaccaaggg 
ccaaggcttg 



ggcatcagct 
tgagagctgc 
tgatgccatfc. 
gcgggatgat 
ctttcagggai_ 
agttgatttg- 
gctggaggco 
ttcttacacc 
tgccatctct 
ggtgcctcct 
gggtaaacac^* 
tagccctgaa 
cagggctgga. 
cccggagacc 
cgacactgga 
gacaccagcc 
ctaccacgac 
gcagcttctg- 
agcacaggcc 
cgagcgggat 
gaactatgaa 
ggctgcagat 
gcttacaaag 
gatcctagca 
gctgctacgg" 
ccacatcacc 
agaggcagca 
cctgacctgt 
tgtctctctg- 
gtacttgctcj" 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 



cgtgtccagg 
cactccctgt 
gactcccata 
actcagaaag 
tggcaggttc 
ctgggtagtg 
ctgcagatac 
cgcaatgaca 
tgcacagagt 
Sia.gggga.Eigc 
ctaagaggcc 
ccttctacaa 
tcccattcac 
ctgcgctggg 
ctggatctgc 
gctctccaag 
gagatcttgg 
gagagcctgc 
ctagagccct 
agctgctgta 
agtgtccctg 
caaaagttag 
agaggactgc 
cccttcacgg 
cccagggtac 
ttggaagacc 
gcttcccggg 
gccccaggta 



tcctgcagct 
gggagcagct 
agctcctccg 
cagctgtgga 
tttcagaggt 
tgagtgaagc 
cacgccagtg 
ttgatctctg 
ttggtggggt 
agcaggccca 
ctgctctaga 
actcctcccc 
ccacctgtga 
tattggtcac 
tgcaggtcgt 
cttccctgaa 
ctcaagcata 
agaaggttct 
ggcgagccag 
ctacccaact 
gctcagagcc 
cctctgctcc 
cctgcacacc 
tgtttgagga 
aacagagagt 
ctgtctcagc 
gccgggggcg 
gtgcccctgg 



ggt^gcagct 
ctgtgcccaa 
aagcatcatc 
gacatcgttt 
gctgagctgc 
caaggccttt 
tgccctgttc 
tcagtcggac 
gactcagcac 
ggtcccctgt 
gctggtggcc 
agtcttgaaa 
ctgctcgctc 
ggc<aggggtg 
gctgaagggc 
tcataaaaca 
cacactgttg 
acagtcaggg 
cctgctcttg 
ttttgcaagc 
ctctaagact 
cctgcgcctc 
taaaccccca 
agtctgccct 
ccagacgcgc 
tga.ggcctgg 
agcaaggaag 
gaaccctggc 



taccttagcc 
ggctggcaga 
ctcctgctga 
ttggactatg 
tcagagaagc 
tgcttggagg 
ctggtgctga 
ctgcagcagg 
ctggactctg 
cctccacagc 
actgtggcca 
accaagcccc 
tgcgccagcc 
aggctggcca 
tgtcctgaag 
cccccctcct 
gcactggagg 
ctgaagtttg 
atttgggccc 
tcctggggct 
cagggccaaa 
aataatacct 
gaccggatca 
acagagagca 
ctcaaggtga 
ctggcagagg 
ggcctgagcc 
ctgaatggca 



tcccgtcaaa 
cacctgagat 
tgggcagtga 
gtgaaaatct 
tggtctgcca 
ccctaaaact 
agggcgagct 
ttctgttctt 
tgaagaaggt 
tcccagagga 
aggagcctgg 
agcccatacc 
ctgtcctcac 
tgggccacca 
ccgctgagcg 
tggttccaag 
gcctgaacca 
tagcagcacg 
tcacaaaact 
ggcagccacc 
aacgttctgg 
ctcagaaagg 
ggcaagctgg 
agcctgaagt 
acttcagtga 
agcctaagag 
taaagacgga 
ggagccggag 



caacctctca 
agctctcata 
cattctctca 
ggtacaaaaa 
cctgggccgc 
tacaacaaag 
ggagctggcc 
gcttgagtct 
ccacctgcag 
ggagctctto 
ccccatagca 
caacttcctg 
agcagtctgt 
agcccagggt 
cctcacccaa 
cctcttggat 
gccatcaaac 
gataccccac 
aggtggcctc 
attaataaaa 
acgagggcgc 
tctggaaggt 
ccctcatgtc 
accccaggcc 
tgacagtgac 
acggggcact 
tgccgtggtt 
ggccaagaag - 



2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 



gtggcatcaa gacattgtga ggag-cggcgt ccccagaggg ccagtgacca ggccaggcct 4440 
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ggccctgaga tcatgaggac catccctgag gaagaactga ctgacaactg gagaaaaatg 4500 

agctttgaga tcctcagggg ctctgacggg gaagactcag cctcaggtgg gaagactcca 4560 

gctccgggcc ctgaggcagc ttctggagaa tgggagctgc tgaggctgga ttccagcaag 462 0 

aagaagctgc ccagcccatg cccagacaag gagagtgaca aggaccttgg tcctcggctc 4680 

cagctcccct cagcccccgt agccactggt ctttctaccc tggactccat ctgtgactcc 4740 

ctgagtgttg ctttccgggg cattagtcac tgtcctccta gtgggctcta tgcccacctc 4800 

tgccgcttcc tggccttgtg cctgggccac cgggatcctt atgccactgc tttccttgtc 4860 

accgagtctg tctccatcac ctgtcgccac cagctgctca cccacctcca cagacagctc 492 0 

agcaaggccc agaagcaccg aggatcactt gaaatagcag accagctgca ggggctgagc 4980 

cttcaggaga tgcctggaga tgtccccctg gcccgcatcc agcgcctctt ttccttcagg 5 040 

gctttggaat ctggccactt cccccagcct gaaaaggaga gtttccagga gcgcctggct 5100 

ctgatcccca gtggggtgac tg-fcgtgtgtg ttggccctgg ccaccctcca gcccggaacc 5160 

gtgggcaaca ccctcctgct gacccggctg gaaaaggaca gtcccccagt cagtgtgcag 522 0 

attcccactg gccagaacaa gcttcatctg cgttcagtcc tgaatgagtt tgatgccatc 52 80 

cagaaggcac agaaagagaa cagcagctgt actgacaagc gagaatggtg gacagggcgg 5340 

ctggcactgg accacaggat ggciggttctc atcgcttccc tagagaagtc tgtgctgggc 5400 

tgctggaagg ggctgctgct gccgtccagt gaggagcccg gccctgccca ggaggcctcc 5460 

cgcctacagg agctgctaca ggeictgtggc tggaaatatc ctgaccgcac tctgctgaaa 5520 

atcatgctca gtggtgccgg tgccctcacc cctcaggaca ttcaggccct ggcctacggg 5580 

ctgtgcccaa cccagccaga gcgagcccag gagctcctga atgaggcagt aggacgtcta 5640 

cagggcctga cagtaccaag caatagccac cttgtcttgg tcctagacaa ggacttgcag 57 00 

aagctgccgt gggaaagcat gcccagcctc caagcactgc ctgtcacccg gctgccctcc 5760 

ttccgcttcc tactcagcta ctccatcatc aaagagtatg gggcctcgcc agtgctgagt 582 0 

caaggggtgg atccacgaag taccttctat gtcctgaacc ctcacaataa cctgtcaagc 5 880 

acagaggagc aatttcgagc caatttcagc agtgaagctg gctggagagg agtggttggg 5940 

gaggtgccaa gacctgaaca ggtgcaggaa gccctgacaa agcatgattt gtatatctat 6000 

gcagggcatg gggctggtgc ccgcttcctt gatgggcagg ctgtcctgcg gctgagctgt 6060 

cgggcagtgg ccctgctgtt tg^ctgtagc agtgcggccc tggctgtgca tggaaacctg 612 0 

gagggggctg gcatcgtgct caagtacatc atggctggtt gccccttgtt tctgggtaat 6180 

ctctgggatg tgactgaccg cgacattgac cgctacacgg aagctctgct gcaaggctgg 6240 

cttggagcag gcccaggggc cccccttctc tactatgtaa accaggcccg ccaagctccc 63 00 

cgactcaagt atcttattgg ggctgcacct atagcctatg gcttgcctgt ctctctgcgg 6360 

taaccccatg gagctgtctt attgatgcta gaagcctcat aactgttcta cctccaaggt 642 0 

tagatttaat ccttaggata actcttttaa agtgattttc cccagtgttt tatatgaaac 6480 

atttcctttt gatttaacct cagtataata aagatacatc atttaaaccc tgaaaaaaaa 6540 
aaaaaaaaaa aa 6552 

<210> 230 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_012291 
<400> 230 

agcctcataa ctgttctacc tccaaggtta gatttaatcc ttaggataac tcttttaaag 60 



<210> 231 

<211> 6317 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_013261 
<400> 231 

tagtaagaca ggtgccttca gttcactctc agtaaggggc tggttgcctg catgagtgtg 60 
tgctctgtgt cactgtggat tgg-agttgaa aaagcttgac tggcgtcatt caggagctgg 12 0 
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atggcgtggg acatgtgcaa ccaggactct gagtctgtat ggagtgacat cgagtgtgct 180 

gctctggttg gtgaagacca gcctctttgc ccagatcttc ctgaacttga tctttctgaa 240 

ctagatgtga acgacttgga tacagacagc tttctgggtg gactcaagtg gtgcagtgac 3 00 

caatcagaaa taatatccaa tcagtacaac aatgagcctt caaacatatt tgagaagata 3 60 

gatgaagaga atgaggcaaa cttgctagca gtcctcacag agacactaga cagtctccct 420 

gtggatgaag acggattgcc ctcatttgat gcgctgacag atggagacgt gaccactgac 480 

aatgaggcta gtccttcctc catgcctgac ggcacccctc caccccagga ggcagaagag 540 

ccgtctctac ttaagaagct cttactggca ccagccaaca ctcagctaag ttataatgaa 600 

tgcagtggtc tcagtaccca gaaccatgca aatcacaatc acaggatcag aacaaaccct 660 

gcaattgtta agactgagaa ttcatggagc aataaagcga agagtatttg tcaacagcaa 72 0 

aagccacaaa gacgtccctg ctcggagctt ctcaaatatc tgaccacaaa cgatgaccct 7 80 

cctcacacca aacccacaga gaacagaaac agcagcagag acaaatgcac ctccaaaaag 840 

aagtcccaca cacagtcgca gtcacaacac ttacaagcca aaccaacaac tttatctctt 900 

cctctgaccc cagagtcacc aaatgacccc aagggttccc catttgagaa caagactatt 9 60 

gaacgcacct taagtgtgga actctctgga actgcaggcc taactccacc caccactcct 102 0 

cctcataaag ccaaccaaga taaccctttt agggcttctc caaagctgaa gtcctcttgc 1080 

aagactgtgg tgccaccacc atcaaagaag cccaggtaca gtgagtcttc tggtacacaa 1140 

ggcaataact ccaccaagaa agggccggag caatccgagt tgtatgcaca actcagcaag 12 00 

tcctcagtcc tcactggtgg acacgaggaa aggaagacca agcggcccag tctgcggctg 1260 

tttggtgacc atgactattg ccagtcaatt aattccaaaa cagaaatact cattaatata 132 0 

tcacaggagc tccaagactc tagacaacta gaaaataaag atgtctcctc tgattggcag 13 80 

gggcagattt gttcttccac agattcagac cagtgctacc tgagagagac tttggaggca 1440 

agcaagcagg tctctccttg cagcacaaga aaacagctcc aagaccagga aatccgagcc 15 00 

gagctgaaca agcacttcgg tcatcccagt caagctgttt ttgacgacga agcagacaag 1560 

accggtgaac tgagggacag tg-atttcagt aatgaacaat tctccaaact acctatgttt 1620 



ataaattcag gactagccat gg-atggcctg tttgatgaca gcgaagatga aagtgataaa 1680 

ctgagctacc cttgggatgg cacgcaatcc tattcattgt tcaatgtgtc tccttcttgt 1740 

tcttctttta actctccatg tagagattct gtgtcaccac ccaaatcctt attttctcaa 1800 

agaccccaaa ggatgcgctc tcgttcaagg tccttttctc gacacaggtc gtgttcccga 1860 

tcaccatatt ccaggtcaag atcaaggtct ccaggcagta gatcctcttc aagatcctgc 192 0 

tattactatg agtcaagcca ctacagacac cgcacgcacc gaaattctcc cttgtatgtg 19 80 

agatcacgtt caagatcgcc ctacagccgt cggcccaggt atgacagcta cgaggaatat 2 040 

cagcacgaga ggctgaagag ggaagaatat cgcagagagt atgagaagcg agagtctgag 2100 

agggccaagc aaagggagag gcagaggcag aaggcaattg aagagcgccg tgtgatttat 2160 

gtcggtaaaa tcagacctga cacaacacgg acagaactga gggaccgttt tgaagttttt 2220 

ggtgaaattg aggagtgcac agtaaatctg cgggatgatg gagacagcta tggtttcatt 22 80 

acctaccgtt atacctgtga tg-cttttgct gctcttgaaa atggatacac tttgcgcagg 2340 

tcaaacgaaa ctgactttga gctgtacttt tgtggacgca agcaattttt caagtctaac 2400 

tatgcagacc tagattcaaa ctcagatgac tttgaccctg cttccaccaa gagcaagtat 2460 

gactctctgg attttgatag tttactgaaa gaagctcaga gaagcttgcg caggtaacat 2 52 0 

gttccctagc tgaggatgac ag-a.gggatgg cgaatacctc atgggacagc gcgtccttcc 2 580 

ctaaagacta ttgcaagtca tacttaggaa tttctcctac tttacactct ctgtacaaaa 2 640 

acaaaacaaa acaacaacaa tacaacaaga acaacaacaa caataacaac aatggtttac 27 00 

atgaacacag ctgctgaaga ggcaagagac agaatgatat ccagtaagca catgtttatt 2760 

catgggtgtc agctttgctt ttcctggagt ctcttggtga tggagtgtgc gtgtgtgcat 2 82 0 

gtatgtgtgt gtgtatgtat gtgtgtggtg tgtgtgcttg gtttagggga agtatgtgtg 2 880 

ggtacatgtg aggactgggg gcacctgacc agaatgcgca agggcaaacc atttcaaatg 2940 

gcagcagttc catgaagaca cgcttaaaac ctagaacttc aaaatgttcg tattctattc 3 000 

aaaaggaaat atatatatat atatatatat atatatatat atatataaat taaaaaggaa 3 060 

agaaaactaa caaccaacca accaaccaac caaccacaaa ccaccctaaa atgacagccg 312 0 

ctgatgtctg ggcatcagcc tttgtactct gtttttttaa gaaagtgcag aatcaacttg 3180 

aagcaagctt tctctcataa cgtaatgatt atatgacaat cctgaagaaa ccacaggttc 3240 

catagaacta atatcctgtc tctctctctc tctctctctc tctctttttt ttttcttttt 3300 

ccttttgcca tggaatctgg gtg-ggagagg atactgcggg caccagaatg ctaaagtttc 33 60 

ctaacatttt gaagtttctg tagttcatcc ttaatcctga cacccatgta aatgtccaaa 342 0 

atgttgatct tccactgcaa atttcaaaag ccttgtcaat ggtcaagcgt gcagcttgtt 3480 

cagcggttct ttctgaggag cggacaccgg gttacattac taatgagagt tgggtagaac 3540 

tctctgagat gtgttcagat agtgtaattg ctacattctc tgatgtagtt aagtatttac 3 600 
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agatgttaaa tggagtattt ttSLttttatg tatatactat acaacaatgt tcttttttgt 3660 

tacagctatg cactgtaaat gcagccttct tttcaaaact gctaaatttt tcttaatcaa 372 0 

gaatattcaa atgtaattat gaggtgaaac aattattgta cactaacata tttagaagct 3780 

gaacttactg cttatatata tttgattgta aaaacaaaaa gacagtgtgt gtgtctgttg 3 840 

agtgcaacaa gagcaaaatg atgctttccg cacatccatc ccttaggtga gcttcaatct 3 90 0 

aagcatcttg tcaagaaata tcctagtccc ctaaaggtat taaccacttc tgcgatattt 396 0 

ttccacattt tcttgtcgct tgtttttctt tgaagtttta tacactggat ttgttagggg 402 0 

aatgaaattt tctcatctaa aatttttcta gaagatatca tgattttatg taaagtctct 4080 

caatgggtaa ccattaagaa atcjtttttat tttctctatc aacagtagtt ttgaaactag 4140 

aagtcaaaaa tctttttaaa atcjctgtttt gttttaattt ttgtgatttt aatttgatac 42 00 

aaaatgctga ggtaataatt atagtatgat ttttacaata attaatgtgt gtctgaagac 42 60 

tatctttgaa gccagtattt ctttcccttg gcagagtatg acgatggtat ttatctgtat 432 0 

tttttacagt tatgcatcct gtataaatac tgatatttca ttcctttgtt tactaaagag 43 80 

acatatttat cagttgcaga tagcctattt attataaatt atgagatgat gaaaataata 4440 

aagccagtgg aaattttcta cctaggatgc atgacaattg tcaggttgga gtgtaagtgc 45 00 

ttcatttggg aaattcagct tttgcagaag cagtgtttct acttgcacta gcatggcctc 45 60 

tgacgtgacc atggtgttgt tcfctgatgac attgcttctg ctaaatttaa taaaaacttc 4620 

agaaaaacct ccattttgat catcaggatt tcatctgagt gtggagtccc tggaatggaa 4680 

ttcagtaaca tttggagtgt gtattcaagt ttctaaattg agattcgatt actgtttggc 4740 

tgacatgact tttctggaag acatgataca cctactactc aattgttctt ttcctttctc 4800 

tcgcccaaca cgatcttgta agatggattt cacccccagg ccaatgcagc taattttgat 4860 

agctgcattc atttatcacc agcatattgt gttctgagtg aatccactgt ttgtcctgtc 492 0 

ggatgcttgc ttgatttttt ggcttcttat ttctaagtag atagaaagca ataaaaatac 4980 

tatgaaatga aagaacttgt tcacaggttc tgcgttacaa cagtaacaca tctttaatcc 5040 

gcctaattct tgttgttctg taggttaaat gcaggtattt taactgtgtg aacgccaaac 5100 

taaagtttac agtctttctt tctgaatttt gagtatcttc tgttgtagaa taataataaa 5160 

aagactatta agagcaataa attattttta agaaatcgag atttagtaaa tcctattatg 5220 

tgttcaagga ccacatgtgt tctctatttt gcctttaaat ttttgtgaac caattttaaa 52 80 

tacattctcc tttttgccct ggeittgttga catgagtgga atacttggtt tcttttctta 5340 

cttatcaaaa gacagcacta cagatatcat attgaggatt aatttatccc ccctaccccc 540 0 

agcctgacaa atattgttac catgaagata gttttcctca atggacttca aattgcatct 5460 

agaattagtg gagcttttgt atcttctgca gacactgtgg gtagcccatc aaaatgtaag 552 0 

ctgtgctcct ctcattttta tttttatttt tttgggagag aatatttcaa atgaacacgt 5580 

gcaccccatc atcactggag gcaaatttca gcatagatct gtaggatttt tagaagaccg 5640 

tgggccattg ccttcatgcc gtggtaagta ccacatctac aattttggta accgaactgg 57 00 

tgctttagta atgtggattt ttttcttttt taaaagagat gtagcagaat aattcttcca 57 60 

gtgcaacaaa atcaattttt tgctaaacga ctccgagaac aacagttggg ctgtcaacat 5 82 0 

tcaaagcagc agagagggaa ctttgcacta ttggggtatg atgtttgggt cagttgataa 5880 

aaggaaacct tttcatgcct ttagatgtga gcttccagta ggtaatgatt atgtgtcctt 5940 

tcttgatggc tgtaatgaga acttcaatca ctgtagtcta agacctgatc tatagatgac 6000 

ctagaatagc catgtactat aatgtgatga ttctaaattt gtacctatgt gacagacatt 6060 

ttcaataatg tgaactgctg atttgatgga gctactttaa gatttgtagg tgaaagtgta 612 0 

atactgttgg ttgaactatg ctgaagaggg aaagtgagcg attagttgag cccttgccgg 6180 

gccttttttc cacctgccaa ttctacatgt attgttgtgg ttttattcat tgtatgaaaa 6240 

ttcctgtgat tttttttaaa tgtgcagtac acatcagcct cactgagcta ataaagggaa 63 00 
acgaatgttt caaatct 6317 

<210> 232 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_013261 
<400> 232 

ctgtagtcta agacctgatc tatagatacc tagaatagcc atgtactata atgtgatgat 60 

<210> 233 
<211> 3237 
<212> UNA 
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<213> Homo sapiens 
<300> 

<308> NM_013277 
<400> 233 

gcgaagtgaa gggtggccca ggtggggcca ggctgactga atgtatctcc tagctatgga_ 60 

ctaaataata catggg^gga aataaacaag tattcatgag ggtgaaaatg tgacccagca 12 0 

ggaaaattac aactattttc aattgacgtt gaataggatg agtcatggaa tttaagtgat 180 

ttactgaaga ttatactact ggtagataga agagctaaag aaagatggat actatgatgc 240 

tgaatgtgcg gaatctgttt gagcagcttg tgcgccgggt ggagattctc agtgaaggaa. 300 

atgaagtcca atttatccag ttggcgaagg actttgagga tttccgtaaa aagtggcaga 360 

ggactgacca tgagctg-ggg aaatacaagg atcttttgat gaaagcagag actgagcgaa 42 0 

gtgctctgga tgttaagctg aagcatgcac gtaatcaggt ggatgtagag atcaaacgga 480 

gacagagagc tgaggctgac tgcgaaaagc tggaacgaca gat t cage tg attcgagaga. 540 

tgctcatgtg tgacaca.tct ggcagcattc aactaagega ggagcaaaaa tcagctctggj- 600 

cttttctcaa cagaggecaa ccatccagca geaatgetgg gaacaaaaga ctatcaacca 660 

ttgatgaatc tggttccatt ttatcagata tcagctttga caagactgat gaatcactgg- 72 0 

attgggactc ttctttg-gtg aagactttca aactgaagaa gagagaaaag aggegctcta 780 

ctagccgaca gtttgtfcgat ggtccccctg gacctgtaaa gaaaactcgt tccattggct 840 

ctgeagtaga ccaggggaat gaatccatag ttgeaaaaac tacagtgact gttcccaatg- 9 00 

atggcgggcc catcgaagct gtgtccacta ttgagactgt gecatattgg accaggagcc 960 

gaaggaaaac aggtacttta caaccttgga acagtgactc caccctgaac agcaggcagc 102 0 

tggagccaag aactgagaca gacagtgtgg gcacgccaca gagtaatgga gggatgegee 1080 

tgeatgaett tgtttctaag aeggttatta aacctgaatc ctgtgttcca tgtggaaagc 1140 

ggataaaatt tggcaaatta tctctgaagt gtcgagactg tcgtgtggtc tctcatccag- 12 00 

aatgtcggga ccgctgtccc cttccctgca ttcctaccct gataggaaca cctgtcaaga 12 60 

ttggagaggg aatgetg-gea gactttgtgt cccagacttc tccaatgatc ccctccattg- 132 0 

ttgtgcattg tgtaaatgag attgagcaaa gaggtctgac tgagacaggc ctgtatagga 13 80 

tctctggctg tgaccgcaca gtaaaagagc tgaaagagaa attcctcaga gtgaaaactg 1440 

tacccctcct cagcaaagtg gatgatatcc atgctatctg tagecttcta aaagactttc 1500 

ttcgaaacct caaagaacct cttctgacct ttcgccttaa cagagecttt atggaagcag- 15 6 0 

cagaaatcac agatgaagac aacagcatag ctgccatgta ccaagctgtt ggtgaactgc 162 0 

cccaggccaa cagggacaca ttagctttcc tcatgattca ettgeagaga gtggctcaga 1680 

gtccacatac taaaatcjgat gttgecaate tggctaaagt ctttggccct acaatagtgg 1740 

cccatgctgt gcccaatcca gacccagtga caatgttaca ggacatcaag cgtcaaccca 1800 

aggtggttga gcgcctg-ctt tccttgcctc tggagtattg gagtcagttc atgatggtgg 1860 

agcaagagaa cattgacccc ctacatgtca ttgaaaactc aaatgccttt tcaacaccac 192 0 

agacaccaga tattaaaigtg agtttactgg gacctgtgac cactcctgaa catcagcttc 1980 

tcaagactcc ttcatctagt tccctgtcac agagagtccg ttccaccctc accaagaaca 2040 

ctcctagatt tgggagcaaa agcaagtctg ccactaacct aggacgacaa ggcaactttt 2100 

ttgcttctcc aatgctcaag tgaagtcaca tctgcctgtt acttcccagc attgactgac 216 0 

tataagaaag gacacatctg tactctgctc tgcagcctcc tgtactcatt actactttta 222 0 

gcattctcca ggcttttact caagtttaat tgtgcatgag ggttttatta aaactatata 22 80 

tatctcccct tccttctcct caagtcacat aatatcagca ctttgtgctg gtcattgttg 2340 

ggagctttta gatgagei.cat ctttccaggg gtagaagggt tagtatggaa ttggttgtga 2400 

ttctttttgg ggaaggcjggt tattgttcct ttggcttaaa gecaaatget gctcatagaa 2460 

tgatctttct ctagtttcat ttagaactga tttccgtgag acaatgacag aaaccctacc 2520 

tatctgataa gattagcttg tctcagggtg ggaagtggga gggcagggca aagaaaggat 2 5 80 

tagaccagag gatttacjgat gcctccttct aagaaccaga agttctcatt ccccattatg 2640 

aactgagcta taatatcjgag ctttcataaa aatgggatgc attgaggaca gaactagtga 2700 

tgggagtatg cgtagctttg atttggatga ttaggtcttt aatagtgttg agtggcacaa 2760 

ccttgtaaat gtgaaacjtac aactegtatt tatctctgat gtgccgctgg ctgaactttg 2 82 0 

ggttcatttg gggtcaaagc cagtttttct tttaaaattg aattcattct gatgettgge 2 880 

ccccataccc ccaaccttgt ccagtggagc ccaacttcta aaggtcaata tatcatcctt 2940 

tggcatccca actaacaata aagagtaggc tataagggaa gattgtcaat attttgtggt 3000 

aagaaaagct acagtcattt tttctttgea ctttggatgc tgaaattttt cccatggaac 3 060 

atagecacat ctagata_gat gtgagctttt tcttctgtta aaattattct taatgtctgt 312 0 

aaaaacgatt ttcttctgta gaatgtttga ettegtattg acccttatct gtaaaacacc 3180 
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tatttgggat aatatttgga. aaaaaagtaa atagcttttt caaaatgaaa aaaaaaa 3237 

<210> 234 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_013277 
<400> 234 

ctcattcccc attatgaact gagctataat atggagcttt cataaaaatg ggatgcattg 6 0 

<210> 235 
<211> 1122 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_013409 
<400> 235 

gctcctcgcc ccgcgcctgc 
tgcctcctgc tgctgctgct 
tgctggctcc gtcaagcgaa 
aaggaggagt gctgcagcac 
gacaacacac tcttcaagtg 
aaagaaacgt gtgagaacgt 
aacaaacccc gctgcgtctg 
tgcgggctgg atgggaaaac 
gagcagccag aactggaagt 
ttctgtccag gcagctccac 
tgtaatcgga tttgcccaga 
gtcacctact ccagtgcctg 
ggattagcct atgagggaaa 
ggtgggaaaa aatgtttatg 
gagctgtgcc ctgacagtaa 
gccagcgagt gtgccatgaa 
cactccggat cttgcaactc 
caggactaca gctttcctat 
gtgttcacat agcctttgtg 

<210> 236 
<211> 60 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_013409 
<400> 236 

gaagatgaag accaggacta cagctttcct atatcttcta ttctagagtg gtaaactctc 60 

<210> 237 

<211> 11389 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_014246 



ccccaggatg gtccgcgcga 
ctgccagttc atggaggacc 
gaacggccgc tgccaggtcc 
cggccggctg agcacctcgt 
gatgattttc aacgggggcg 
ggactgtgga cctgggaaaa 
cgccccggat tgttccaaca 
ctaccgcaat gaatgtgcac 
ccagtaccaa ggcagatgta 
atgtgtggtg gaccagacca 
gcctgcttcc tctgagcaat 
ccacctgaga aaggctacct 
gtgtatcaaa gcaaagtcct 
ggatttcaag gttgggagag 
gtcggatgag cctgtctgtg 
ggaagctgcc tgctcctcag 
catttcggaa gacaccgagg 
atcttotatt ctagagtggt 
caaaaaaaaa aaaaaaaaaa 



ggcaccagcc gggtgggcfct 60 

gcagtgccca ggctgggaac 12 0 

tgtacaagac cgaactgagc 180 

ggaccgagga ggacgtgaat 240 

cccccaactg catcccctgt 3 00 

aatgccgaat gaacaagaag 3 60 

tcacctggaa gggtccagtc 42 0 

tcctaaaggc aagatgtaaa 480 

aaaagacttg tcgggatgtt 54 0 

ataatgccta ctgtgtgacc 600 

atctctgtgg gaatgatgcja 660 

gcctgctggg cagatctatt 72 0 

gtgaagatat ccagtgcact 7 80 

gccggtgttc cctctgtga_t 840 

ccagtgacaa tgccacttat 900 

gtgtgctact ggaagtaaag 960 

aagaggagga agatgaagac 102 0 

aaactctcta taagtgttca 1080 
aa 1122 
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<400> 237 

atggcgccgc cgccgccgcc cgtgctgccc gtgctgctgc tcctggccgc cgccgccgcc 60 
ctgccggcga tggggctgcg agcggccgcc tgggagccgc gcgtacccgg cgggacccgc 12 0 
gccttcgccc tccggcccgg ctgtacctac gcggtgggcg ccgcttgcac gccccgggcg 180 
ccgcgggagc tgctggaccrt gggccgcgat gggcggctgg caggacgtcg gcgcgtctcg 240 
ggcgcggggc gcccgctgcc gctgcaagtc cgcttggtgg cccgcagtgc cccgacggcg 3 00 
ctgagccgcc gcctgcgg^c gcgcacgcac cttcccggct gcggagcccg tgcccggctc 360 
tgcggaaccg gtgcccggct ctgcggggcg ctctgcttcc ccgtccccgg cggctgcgcg 42 0 
gccgcgcagc attcggcgct cgcagctccg accaccttac ccgcctgccg ctgcccgccg 480 
cgccccaggc cccgctgtcc cggccgtccc atctgcctgc cgccgggcgg ctcggtccgc 540 
ctgcgtctgc tgtgcgccct gcggcgcgcg gctggcgccg tccgggtggg actggc^ctg 600 
gaggccgcca ccgcggggac gccctccgcg tcgccatccc catcgccgcc cctgccgccg 660 
aacttgcccg aagcccgggc ggggccggcg cgacgggccc ggcggggcac gagcggcaga 72 0 
gggagcctga agtttccgat gcccaactac caggtggcgt tgtttgagaa cgaaccggcg 7 80 
ggcaccctca tcctccagct gcacgcgcac tacaccatcg agggcgagga ggagcgcgtg 840 
agctattaca tggaggggct gttcgacgag cgctcccggg gctacttccg aatcgactct 9 00 
gccacgggcg ccgtgagcac ggacagcgta ctggaccgcg agaccaagga gacgcacgtc 960 
ctcagggtga aagccgtgga ctacagtacg ccgccgcgct cggccaccac ctacatcact 102 0 
gtcttggtca aagacaccaa cgaccacagc ccggtcttcg agcagtcgga gtaccgcgag 1080 
cgcgtgcggg agaacctgga ggtgggctac gaggtgctga ccatccgcgc cagcgaccgc 1140 
gactcgccca tcaacgccaa cttgcgttac cgcgtgttgg ggggcgcgtg ggacgtcttc 1200 
cagctcaacg agagctctcjg cgtggtgagc acacgggcgg tgctggaccg ggaggag-gcg 12 6 0 
gccgagtacc agctcctgg-t ggaggccaac gaccaggggc gcaatccggg cccgctcagt 132 0 
gccacggcca ccgtgtacat cgaggtggag gacgagaacg acaactaccc ccagttcagc 13 80 
gagcagaact acgtggtcca ggtgcccgag gacgtggggc tcaacacggc tgtgctg-cga 1440 
gtgcaggcca cggaccggga ccagggccag aacgcggcca ttcactacag catcctcagc 1500 
gggaacgtgg ccggccagtt ctacctgcac tcgctgagcg ggatcctgga tgtgatcaac 1560 
cccttggatt tcgaggatcrt ccagaaatac tcgctgagca ttaaggccca ggatggg-ggc 162 0 
cggcccccgc tcatcaattc ttcaggggtg gtgtctgtgc aggtgctgga tgtcaacgac 1680 
aacgagccta tctttgtgatg cagccccttc caggccacgg tgctggagaa tgtgcccctg 1740 
ggctaccccg tggtgcacsLt tcaggcggtg gacgcggact ctggagagaa cgcccggctg 1800 
cactatcgcc tggtggacac ggcctccacc tttctggggg gcggcagcgc tgggcctaag 1860 
aatcctgccc ccacccctga cttccccttc cagatccaca acagctccgg ttggatcaca 192 0 
gtgtgtgccg agctggaccg cgaggaggtg gagcactaca gcttcggggt ggaggcg^tg 19 80 
gaccacggct cgccccccat gagctcctcc accagcgtgt ccatcacggt gctggacg-tg 2 040 
aatgacaacg acccggtgtt cacgcagccc acctacgagc ttcgtctgaa tgaggatg-cg 2100 
gccgtgggga gcagcgtgct gaccctgcag gcccgcgacc gtgacgccaa cagtgtgatt 2160 
acctaccagc tcacaggccr<g caacacccgg aaccgctttg cactcagcag ccagagaggg 222 0 
ggcggcctca tcaccctggc gctacctctg gactacaagc aggagcagca gtacgtgotg 22 80 
gcggtgacag catccgacgg cacacggtcg cacactgcgc atgtcctaat caacgtcact 23 40 
gatgccaaca cccacaggcc tgtctttcag agctcccatt acacagtgag tgtcagt^ag 240 0 
gacaggcctg tgggcacctc cattgctacc ctcagtgcca acgatgagga cacaggagag 2460 
aatgcccgca tcacctacgt gattcaggac cccgtgccgc agttccgcat tgaccccgac 252 0 
agtggcacca tgtacaccat gatggagctg gactatgaga accaggtcgc ctacacgctg 2580 
accatcatgg cccaggacaa cggcatcccg cagaaatcag acaccaccac cctagagatc 2 640 
ctcatcctcg atgccaatga caatgcaccc cagttcctgt gggatttcta ccagggttcc 27 00 
atctttgagg atgctccacc ctcgaccagc atcctccagg tctctgccac ggaccgggac 2760 
tcaggtccca atgggcgtct gctgtacacc ttccagggtg gggacgacgg cgatggg^ac 2 82 0 
ttctacatcg agcccacgtc cggtgtgatt cgcacccagc gccggctgga ccgggagcaat 2 880 
gtggccgtgt acaacctttg ggctctggct gtggatcggg gcagtcccac tccccttagc 2940 
gcctcggtag aaatccaggt gaccatcttg gacattaatg acaatgcccc catgttt^ag 3 000 
aaggacgaac tggagctgtir tgttgaggag aacaacccag tggggtcggt ggtggcaeiag 3 060 
attcgtgcta acgaccctga tgaaggccct aatgcccaga tcatgtatca gattgtggraa 312 0 
ggggacatgc ggcatttctt ccagctggac ctgctcaacg gggacctgcg tgccatgg-tg 3180 
gagctggact ttgaggtcccr gcgggagtat gtgctggtgg tgcaggccac gtcggctccg 3240 
ctggtgagcc gagccacggt gcacatcctt ctcgtggacc agaatgacaa cccgcctgtg 33 00 
ctgcccgact tccagatcct cttcaacaac tatgtcacca acaagtccaa cagtttcccc 33 60 
accggcgtga tcggctgcat cccggcccat gaccccgacg tgtcagacag cctcaactac 342 0 
accttcgtgc agggcaacga gctgcgcctg ttgctgctgg accccgccac gggcgaactg 3480 
cagctcagcc gcgacctggei caacaaccgg ccgctggagg cgctcatgga ggtgtctg-tg 3 54 0 
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tctgatggca 
gacatgctga 
tccccgctgc 
gacgtcttcg 
accttctcgg 
caggagcaga 
cccttcgacg 
gttctgcgat 
atccacccca 
gagacggaga 
cgcgagggcg 
gatgcccgct 
ctgctcatcg 
gaggtgacca 
cgcttccact 
tacaacggcc 
gtgcagctca 
ggtgtgagtg 
ggccacctgg 
gattgtgaca 
gcccagggca 

gggggtgtcc 

atgcggaacc 
ggcacccggg 
ggaggcacct 
gggaagaact 
tcctggagtg 
acccggaagg 
ctccagatcc 
tccgtgatgc 
ctgaagaatg 
gggatggacc 
gtggtggtcg 
atgcagggag 
gcactcaagg 
tgtcccccca 
gggtaccttg 
gcctgcgtgc 
tacgggccgt 
cccgtctgtg 
accaacggcc 
ctgccctgcg 
cagtgtgcct 
gccgaggtca 
gccggcatct 
ggatccgttg 
ctctttaact 
cgcaatgaga 
gctacacagc 
ggccacgtcc 
gccgactttc 
gcggcgtggg 
gagggctact 
atcgtcaccg 
gccagggtcc 
gtctccttcc 
ccggctggcc 
gccccgatca 
gtcatcattt 
agcctccggt 
agcgaggggg 



tccacagcgt 

ccaacagcat 

tggccctctt 

tcttcaacgrt 

cgctgctgcc 

tctacctgaa 

acaacatctg 

tcgacagctc 

tcaacggcc t 

tcgacctctg 

gctacacctg 

caggccgctg 

gcggcttcca 

ccaggagct t 

tcaccatctc 

gcttcaatg-a. 

ccttctctcjc 

acgggcggtg- 

gcctgcccca 

caaccatggc 

ctcagaccgg 

ccaacctgcc 

tgtcagtcg-oi 

aaggctgcgrc 

gtgtcaacag 

gtgagcaag-c 

acctgaacat 

aggacagcg-t 

tgaacaacta 

tgtccgggt t 

ttaaggagg-ei 

agaacaaggc 

gaggcgcctc 

tgaggatggcj 

tcagggtgasi 

atagccgct^ 

gaataaactg 

gctcccccgg 

actgtgagaa 

gaccctgcca 

agtgccaatg 

actgcttccc 

gcaagcccgg 

ccacgctcgg 

ggtggccaca 

gaaatgcgg-fc 

gtaccaccat 

cgcaggtggei 

acacgggcac 

ttcagcacga 

acgaggacgt 

agcagatcca 

tcagcaacgt 

ccaacatgat 

cgcgattcgei 

cagccgactt 

ggaggaccac 

gcaggcggag- 

accgcaccct 

tgcctcaccg 

ctccgctcco 



cacggccttc 

cactgtccgc 

cgtggagggg 

ccagaacgac 

tggcggcgtc 

tcggacgctg 

cctgcgcgag 

cgcgcccttc 

gcgctgccgc 

ctactccgac 

cgagtgcttc 

tgccaacggg 

ctgcgtgtgt 

cccgccccag 

cctcacgttt 

gaagcacgac 

aggcgagaca 

gcactctgtg 

tgggccgtcc 

tgtgcgcttt 

ctccaagaag 

agaagacttc 

cggcaaaaat 

tgctcggagg 

gtggaatatg 

catgcctcac 

catcatctct 

tctgatggag 

cctccagttt 

gcgggtgacc 

cagtgagatg 

agatatcggg 

tgaagacaag 

ggggacgccc 

ggacggctgt 

ccacgacgcc 

tgtggatgcc 

ctccccgcag 

caaactcgac 

ctgtgccgtc 

caaggagaat 

ccatggctcc 

cgtcatcggc 

ctgtgaagtg 

gaccaagttc 

ccgacactgc 

ctccttcgtg 

cggcgccagg 

gctctttggc 

gagctggcag 

catccactcg 

gcggagcgag 

ggcacgcaac 

tcttgctgtc 

caccatccat 

cttcagacca 

cccgcagacc 

gcgacaccct 

ggggcagctc 

gcccatcatt 

gagacccctg 



tgcaccctgc 

ctggagaaca 

gtggccgccg 

accgacgtca 

cgcggccagt 

ctgaccacca 

ccctgcgaga 

ctcagctcca 

tgcccgcccg 

ccgtgcggcg 

gaggacttca 

gtgtgcaaga 

cctcctggcg 

tccttcgtca 

gccactcagg 

ttcatcgccc 

acaacgaccg 

caggtgcagt 

ggggaaaaga 

ggaaaggaca 

tccctggatc 

ccagtgcaca 

gtggacatgg 

aacttctgcg 

tatctgtgtg 

ccccagctct 

gtgccctggt 

gccaccagtg 

gaggtgtccc 

gacggggagt 

aagcacctgg 

ggcatgcttc 

gtctccgtgc 

accaacgtcg 

gatgtggacg 

tgggaggact 

tgtcacctga 

ggctacgtgt 

cttccgtgcc 

agcaaaggct 

tactacaagc 

cacagccgca 

cgccagtgca 

atctacaatg 

gggcagccgg 

agcggggaga 

gacctcaggg 

gccctgcagc 

aatgacgtgc 

cagggcttcg 

ggcagcgccc 

ggcggcacgg 

gtgcggcgga 

gacatctttg 

gaagagttcc 

cctgaagaaa 

acgcgcccgg 

gatgacgctg 

ctgcccgagc 

aataccccga 

gagaggcccg 



gtgtcaccat 

tgtcccagga 

tgctgtccac 

gctccaacat 

tcttcccgtc 

tctccacgca 

actacatgaa 

ccaccgtgct 

gcttcaccgg 

ccaacggccg 

ctggagagca 

acgggggcac 

agtatgagag 

ccttccgggg 

aaaggaacgg 

tggagatcgt 

tggcaccgaa 

actacaacaa 

tggccgtggt 

tcgggaacta 

tgaccggccc 

accggcagtt 

ccggattcat 

atgggaggcg 

agtgtccact 

tcagcggtga 

acctggggct 

gtgggcccac 

acggcccctc 

ggcaccacct 

tcaccatgac 

ccgggctgac 

gccgtggatt 

ccaccctgaa 

acccctgtac 

acagctgcgt 

acccctgcga 

gcgagtgtgg 

ccagaggctg 

ttgatcccga 

tcctagccca 

cttgcgacat 

accgctgcga 

gctgtcccaa 

ctgcggtgcc 

agggctggct 

ccatgaatga 

tggtgagggc 

gcacggccta 

acctggcagc 

tcctggcccc 

cacagctgct 

cgtacctgcg 

acaagttcaa 

ccagggagct 

aagaaggccc 

ggcctggcac 

gccagttcgc 

gctacgaccc 

tggtgagcac 

tcctggtgga 



catcacggac 3 600 

gaagttcctg 3 660 

caccaaggac 372 0 

cctgaacgtg 3780 

ggaggacctg 3 840 

gcgcg-tgctg 3 9 00 

gtgcg-tgtcc 3960 

cttccggccc 402 0 

cgactactgc 4080 

ctgccgcagc 4140 

ctgtgeiggtg 42 00 

ctgcgtgaac 42 60 

gccctactgt 4320 

cctgagacag 4380 

cttgc-ttctc 4440 

ggacgagcag 45 00 

ggttcccagt 4560 

gcccaatatt 462 0 

gacagtggat 4680 

cagctgcgct 4740 

tctactcctg 4800 

cgtggg-ctgc 4860 

cgccaacaat 492 0 

gtgtcagaat 4980 

ccgattcggc 5040 

gagcgtcgtg 5100 

catgttccgg 5160 

cagctttcgc 5220 

cgatgtggag 52 80 

get gate gag 5340 

cttggactat 5400 

ggtaaggagc 546 0 

ccgaggctgc 5 52 0 

catgaacaac 5580 

ctcgagcccc 5640 

ctgtgacaaa 5700 

gaacatgggg 57 60 

gcccagtcac 5 82 0 

gtgggg-gaac 5 8 80 

ctgtaataag 5940 

ggacacctgt 6000 

ggccaccggg 6060 

caacccgttt 612 0 

agcatttgag 6180 

atgccctaag 6240 

gcccccagag 63 00 

gaagctgagc 63 60 

gctgccjcagt 642 0 

ccagcfcgctg 6480 

cacgcaggac 6540 

agccaccagg 6600 

ccggccjcctc 6660 

gcccttcgtc 6720 

etttaeggga 67 80 

ggagtcctcc 6840 

ectgetgagg 6900 

cgagag-ggag 69 60 

cgtcgctctg 702 0 

cgacccjtcgc 7 080 

gctggtgtac 7140 

gttcgccctg 7200 
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ctggaggtgg aggagcgaac caagcctgtc tgcgtgttct ggaaccactc cctggccgtt 72 60 

ggtgggacgg gagggtgcrtc tgcccggggc tgcgagctcc tgtccaggaa ccggacacat 732 0 

gtcgcctgcc agtgcagcca cacagccagc tttgcggtgc tcatggatat ctccag-gcgt 73 80 

gagaacgggg aggtcctgcc tctgaagatt gtcacctatg ccgctgtgtc cttgtcactg 7440 

gcagccctgc tggtggcott cgtcctcctg agcctggtcc gcatgctgcg ctccaacctg 7500 

cacagcattc acaagcacct cgccgtggcg ctcttcctct ctcagctggt gttcgtgatt 7560 

gggatcaacc agacggaaaa cccgtttctg tgcacagtgg ttgccatcct cctccactac 762 0 

atctacatga gcaccttfcgc ctggaccctc gtggagagcc tgcatgtcta ccgcatgctg 7 680 

accgaggtgc gcaacatcga cacggggccc atgcggttct actacgtcgt gggctg-gggc 77 40 

atcccggcca ttgtcacs.gg actggcggtc ggcctggacc cccagggcta cgggaacccc 7800 

gacttctgct ggctgtccrct tcaagacacc ctgatttgga gctttgcggg gcccatcgga 7860 

gctgttataa tcatcaacac agtcacttct gtcctatctg caaaggtttc ctgccaaaga 792 0 

aagcaccatt attatggg-aa aaaagggatc gtctccctgc tgaggaccgc attcct cctg 7980 

ctgctgctca tcagcgccac ctggctgctg gggctgctgg ctgtgaaccg cgatgcactg 8040 

agctttcact acctcttcgc catcttcagc ggcttacagg gccccttcgt cctccttttc 8100 

cactgcgtgc tcaaccagga ggtccggaag cacctgaagg gcgtgctcgg cgggag^-aag 8160 

ctgcacctgg aggactccgc caccaccagg gccaccctgc tgacgcgctc cctcaactgc 82 2 0 

aacaccacct tcggtgacgg gcctgacatg ctgcgcacag acttgggcga gtccaccgcc 82 80 

tcgctggaca gcatcgtcag ggatgaaggg atccagaagc tcggcgtgtc ctctgg^ctg 83 40 

gtgaggggca gccacggaga gccagacgcg tccctcatgc ccaggagctg caaggatccc 8400 

cctggccacg attccgactc agatagcgag ctgtccctgg atgagcagag cagctcttac 84 60 

gcctcctcac actcgtcaga cagcgaggac gatggggtgg gagctgagga aaaatgggac 852 0 

ccggccaggg gcgccgtcca cagcaccccc aaaggggacg ctgtggccaa ccacgttccg 85 80 

gccggctggc ccgaccagag cctggctgag agtgacagtg aggaccccag cggcaagccc 8640 

cgcctgaagg tggagaccaa ggtcagcgtg gagctgcacc gcgaggagca gggcagtcac 87 00 

cgtggagagt accccccgga ccaggagagc gggggcgcag ccaggcttgc tagcagccag 87 60 

cccccagagc agaggaaa_g-g catcttgaaa aataaagtca cctacccgcc gccgctgacg 882 0 

ctgacggagc agacgctgaa gggccggctc cgggagaagc tggccgactg tgagcag-agc 8880 

cccacatcct cgcgcacg-tc ttccctgggc tctggcggcc ccgactgcgc catcacagtc 8940 

aagagccctg ggagggag-cc ggggcgtgac cacctcaacg gggtggccat gaatgt^cgc 9 000 

actgggagcg cccaggccga tggctccgac tctgagaaac cgtgaggcaa gcccgtcacc 9 060 

ccacacaggc tgcggcatca ccctcagacc ttggagccca aggggccact gcccttgaag 912 0 

tggagtgggc ccagagtg-fcg gcggtcccca tggtggcagc cccccgactg atcatccaga 9180 

cacaaaggtc ttggttctcc caggagctca gggcctgtca gacctggtga caagtgccaa 92 40 

aggccacagg catgagggag gcgtggacca ctgggccagc accgctgagt cctaagactg 93 00 

cagtcaaagc cagaactgag aggggacccc agactgggcc cagaggctgg ccagagttca 93 6 0 

ggaacgccgg gcacagacca aagaccgcgg tccagccccg cccaggcggg catctcatgg 942 0 

cagtgcggac ccgtggctgg cagcccgggc agtcctttgc aaaggcaccc cttgtcfctaa 9480 

aatcacttcg ctatgtggg-a aaggtggaga tacttttata tatttgtatg ggactctrgag 9540 

gaggtgcaac ctgtatatat attgcattcg tgctgacttt gttatcccga gagatccatg 9600 

caatgatctc ttgctgtctt ctctgtcaag attgcacagt tgtacttgaa tctggcatgt 9660 

gttgacgaaa ctggtgcccc agcagatcaa aggtgggaaa tacgtcagca gtggggctaa 972 0 

aaccaagcgg ctagaagccc tacagctgcc ttcggccagg aagtgaggat ggtgtgg-gcc 97 80 

ctccccgccg gccccctggg tccccagtgt tcgctgtgtg tgcgtttgtc ctctgctgcc 9840 

atctgccccg gctgtgtgeia ttcaagacag ggcagtgcag cactaggcag gtgtgag-gag 9900 

ccctgctgag gtcactgtcrg ggcacggttg ccacacggct gtcatttttc acctggtcat 99 60 

tctgtgacca ccaccccctc ccctcaccgc ctcccaggtg gcccgggagc tgcaggfcggg 10020 

gatggctttg tcctttgctc ctgctccccg tgggacctgg gaccttaaag cgttgcetggt 10080 

tcctgatttg gacagaggtg tggggccttc caggccgtta catacctcct gccaattctc 10140 

taactctctg agactgcgag gatctccagg cagggttctc ccctctggag tctgaccaat 102 00 

tacttcattt tgcttcaasit ggccaattgt gcagagggac aaagccacag ccacactctt 10260 

caacggttac caaactgttt ttggaaattc acaccaaggt cgggcccact gcaggca_gct 1032 0 

ggcacagcgt ggcccgagg-g gctgtggaac gggtcccgga actgtcagac atgtttg-att 10380 

ttagcgtttc ctttgttctt caaatcaggt gcccaaataa gtgatcagca cagctgcttc 10440 

caaataggag aaaccataaa ataggatgaa aatcaagtaa aatgcaaaga tgtccacact 10500 

gttttaaact tgaccctgat gaaaatgtga gcactgttag cagatgccta tgggaga_gga 10560 

aaagcgtatc tgaaaatgg-t ccaggacagg aggatgaaat gagatcccag agtcctcaca 1062 0 

cctgaatgaa ttatacatg-t gccttaccag gtgagtggtc tttcgaagat aaaaaactct 10680 

agtcccttta aacgtttgcc cctggcgttt cctaagtacg aaaaggtttt taagtcttcg 10740 

aacagtctcc tttcatgact ttaacaggat tctgccccct gaggtgtaat ttttttg-ttc 10800 
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tatttttttc 
actgttacca 
attacttaag 
attaatgttc 
gaatttgtac 
cagggtggcc 
tccctcagcc 
cctagatggg 
tcttcagttt 
agacgttaga 



cacgtactcc 
aaaaacaact 
actttatg^g 
aaaacactgg 
ttctgcggfcg 
gtggaaggg-a 
agtggaacgt 
cagtttttg-t 
tcttttcttg 
cctcttgafcg 



acagccaaca 
gtcagtttta 
agagattaga 
aattacaaat 
ctggtttttc 
cggtttacgg 
gccggaactt 
ctttcaccaa 
cactgatctt 
ttttcccact 



tcacgaggtg 
ttgagatggg 
cactggaggt 
gagaagagtc 
tccacaaaca 
acgtgcagct 
tttgtccatt 
atttgaggac 
tctcctctcc 
ggtccctgag 



taatttttaa 
aaaaatgtaa 
ttttaacaga 
tacaataaat 
cccccgcccc 
gagctgtccg 
ccctagtagg 
tttttttttt 
ttctgtgact 
gctctgttc 



tttgatcaga 
acctattttt 
acgtgtattt 
taagafctttt 
tccccatgcc 
tgtcccatgc 
cctgccacag 
tgccattatt 
ccagtgactc 
11389 



<210> 238 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_014246 
<400> 238 

gggagagatt agacactgga ggtttttaac agaacgtgta tttattaatg ttcaaaacac 

<210> 239 

<211> 4372 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_014314 



10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 



60 



<400> 239 

tagttattaa 

cctgctttcc 

cctgcgggga 

gcctgcaagc 

acatggcccc 

agggcccaat 

gctggttccg 

ccattgaaag 

aacgtttaca 

ctgaatgttt 

tgatggcagg 

ccaaaacttt 

tagagaaagg 

acatacagat 

caccttcaga 

tagagcttgc 

gtggaaaaac 

gacaaaaggg 

ctgtattctc 

cagctgagaa 

cacagattct 

tgatgatatt 

ttaattatct 

tgactgcctc 

gcaagctgtg 

aactggagca 

gcgacaaatt 

gaatctgcaa 

aatatgaaca 

aagatgaaga 



agttcctatg 
ccgctcgcca 
acgtagctag 
cttccaggat 
ctggtttacjg 
ggaggctgcc 
tggctttttg 
ttgggatttc 
accagaattt 
aattaatcag 
tgcagagaaa 
gaaacttgct 
tataaaagat 
tttctaccaa 
agtgtctga-t 
tttgcctgct 
ctttgtttca. 
gaaagttgtc 
aaaatacttt 
tgtcccagtg 
tgtgaacaac 
tgatgaatg~c 
agatcagaaa 
ggttggtgtt 
tgcttctct t 
agttgttta,t 
taaatacatc 
agacctcgaa 
atggattgt t 
gagcaggat t 



cagctccgcc 
cgccctcctc 
ctgcaagcag 
tatatccgga 
gaggaagagg 
acactttttc 
gatgccctag 
aaaaaaattg 
aaaaccagaa 
gaatgtgaag 
ttggtggaat 
ttggagaaag 
gttgaaacag 
gaagatccag 
acaaacttgt 
atgaaaggaa 
ctgcttatat 
ttttttgcga 
gaaagacatg 
gaacagattg 
cttaaaaagg 
cacaacacta 
cttggaggat 
ggggatgcca 
gatgcgtcag 
aagccccaga 
atagctcagc 
aacttatctc 
acagttcaga 
tgtaaagccc 



tcgcgtccgg 
ctacccggct 
aggccggcat 
agaccctgga 
tgcagtatat 
tcaagttcct 
accatgcagg 
aaaagttgga 
ttatcccaac 
aaattctaca 
gccttctcag 
aaaggaacaa 
aagatcttga 
aatgccagaa 
acagcccatt 
aaaacacaat 
gtgaacatca 
atcagatccc 
ggtatagagt 
ttgagaacaa 
gaacgattcc 
gtaaacaaca 
cttcaggccc 
aaaacacaga 
tgatagcaac 
agtttttcag 
tgatgaggga 
aaattcaaaa 
aagcatgcat 
tgtttttata 



cctcatttcc 
ttaaagctag 
gaccaccgag 
ccctacctac 
tcaggctgag 
gttggagctc 
ttattctgga 
ggagtataga 
cgatatcatt 
gatttgctct 
atcagacaag 
gttcagtgaa 
ggataagatg 
tcttagtgag 
taaaccaaga 
aatatgtgct 
tcttaaaaaa 
agtgtatgaa 
tacaggcatt 
tgacatcatc 
atcactatcc 
cccgtacaat 
actgccccag 
tgaagccttg 
agtcaaacac 
gaaagtggaa 
cacagagagt 
tagggaattt 
ggtgttccag 
cacttcacat 



tcggaaaatc 60 

tgaggcacag 12 0 

cagcgacgca 180 

atcctgagct 240 

aaaaacaaca 3 00 

caggaggaag 3 60 

ctttatgaag 42 0 

ttactt ttaa 480 

tctgatotgt 540 

actaagggga 600 

gaaaac tggc 660 

ctgtgg«attg 72 0 

gaaact-bctg 7 80 

aattcatgtc 840 

aattaccaat 900 

cctacaggtt 960 

ttcccacaag 1020 

cagcagaaat 1080 

tctggagcaa 1140 

attttaactc 12 00 

atctttsictt 1260 

atgatceitgt 132 0 

gtcattg-ggc 13 80 

gattatsitct 1440 

aatctgg-agg 15 00 

tcacggatta 15 60 

ctggcaaaga 162 0 

ggaacaoaga 1680 

atgccagaca 1740 

ttgcggsiaat 1800 
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ataatgatgc 
aagacttctt 
agagatttga 
atcctaaact 
caataacaat 
aaggaaatcc 
atcagaacac 
gtggagatca 
agtgcaatct 
gaggcagagg 
ttgaaaaaga 
ttcagacatg 
aattcatcag 
tgctctgcag 
aatgccatta 
ccaagccaaa 
actgcagcca 
aaattgaaag 
ggaaggactt 
aggtcctcaa 
gtgggtataa 
agcagtttgt 
tttgtaggcc 
tctttttaga 
tgaatgaatg 
ggcttttgcc 
tgccaaatcc 
gagcactctt 
ttttattgag 
tactgtacat 
atatgtactc 
cataaagcca 
caatgtgtat 
cttcttgaaa 
ccattgttct 
gtctgttctc 
aaagattaga 
ttcaaagagc 
atcaacggga 
caggcgggga 
gtaaaacaag 
tgtgaaaagt 
taagacccaa 



cctcattatc 
cagcaatgtc 
agaaaagctg 
tgaagacctc 
tctctttgtg 
taaactcagt 
aggaatgacc 
caatattctg 
tgtcatcctt 
aagagcaaga 
acaaataaac 
ggacgaagca 
agatagtcaa 
aaagtgcaaa 
cactgtgctt 
gcagttttca 
tgactgggga 
ttttgtggtg 
tcattttgag 
tcttcagcta 
tcatggatcg 
actatattat 
aacagagctc 
gtcaactgcc 
gaatgttggg 
cttaacaatc 
actgggcact 
cttgaagctc 
tcctcctctt 
atgtatatgt 
atgttcctgg 
tttttatgct 
ccactctaca 
acttagttta 
ctttgaccat 
agatcccttg 
atatgtatcc 
ttagtttgcg 
tgaggtgtta 
gccagagaaa 
gataatactg 
acatgacaca 
tacaaatgga 



a.gtgagcatg 
ogagcagcag 
caggaactag 
tgcttcatct 
aaaaccagag 
tttctaaaac 
ctcccggcac 
attgccacct 
tatgagtatg 
^gtagcaagt 
atgtacaaag 
g-tatttaggg 
gaaaaaccaa 
gccttggcat 
cjgagatgctt 
cigttttgaaa 
oitccatgtga 
cjaggatattg 
aagataccat 
cagggaatga 
cttgtacccc 
atatgcaaag 
oitagtacttg 
aacaaacaca 
^.atttttatc 
tactgaaatt 
tcttaacctt 
tccctgggct 
tgctgatccc 
aactaatata 
tgtagcaacg 
tactactaaa 
g-tggtcagct 
ctatacatct 
tttcctataa 
gatgttacag 
aaatggcatt 
Q-tttcctgga 
cagctgcctc 
cttgccagtt 
aactgtaagg 
Q-tagttgctt 
ttatttaaac 



cacgaatgaa 
gattcgatga 
aaagtgtttc 
tacaagaaga 
cacttgtgga 
ctggcatatt 
agaagtgtat 
cagttgctga 
tgggcaatgt 
gcttccttct 
aaaaaatgat 
aaaagattct 
aacctgtacc 
gttacacagc 
ttaaggaatg 
aaagagcaaa 
agtacaagac 
caactggagt 
ttgatccagc 
gtaactttga 
tgtgaaaata 
cacaaatgag 
ggaaaaatta 
cagtaatcac 
tccctttgtc 
gttcttttga 
ctatttgacc 
tctctctctt 
ttccaagggt 
catacataca 
tgtggtatgg 
agctgtccac 
tttagtagag 
tgccctatta 
tgatgttgat 
atgaggcagt 
cacgtgtcac 
cgtggaaaca 
cctcttcatg 
atataacttc 
gttagtggag 
gataatagtt 
caaaaaaaaa 



agatgctctg 
gattgagcaa 
cagggatccc 
gtaccactta 
cgctttaaaa 
gactggacgt 
attggatgca 
tgaaggcatt 
catcaaaatg 
gactagtaat 
gaatgactct 
gcatatacag 
tgataaggaa 
tgacgtaaga 
ctttgtgagt 
gatattctgt 
atttgagatt 
tcagacactg 
agaaatgtcc 
gtggagaaga 
tattttttaa 
tgaatcacag 
aaaagcctca 
tctgtacaca 
tccttaacct 
aggttaccag 
tctgcgcatt 
ctagttctat 
tcaatatata 
ggtatgtata 
ctacacagag 
tgtagagttg 
agcataaaaa 
atatgttctc 
gttcaacacc 
ctgactgtcc 
ttagcaaggt 
agtatctgag 
caatctggtg 
tctttggctt 
agtttttaat 
actagtagta 
aaaaaaaaaa 



gattacttga 1860 

gatcttactc 192 0 

agcaatgatga 19 80 

aacccagaga 2 040 

aattggattg 2100 

ggcaaaaoaa 216 0 

ttcaaagcca 222 0 

gacattgcac 22 8 0 

atccaaacca 23 40 

gctggtgtaa 2400 

attttacg-cc 2460 

actcatgaaa 2 52 0 

aataaaaaac 2 580 

gtgatagstgg 2 640 

agaccacatc 27 00 

gcccgacaga 27 60 

ccagttataa 2 82 0 

tactcgaagt 2 880 

aaatgatatc 2 940 

aacaaacata 3 000 

aaatatcttt 3060 

cactgagtat 312 0 

tttctagcct 3180 

ctgggatstga 3240 

actgtaaact 33 00 

tgactctcjgt 33 60 

tggccctg-tt 3 42 0 

tctagtcttt 3480 

tacatgtata 3 540 

tgtaatgg-tt 3 60 0 

aacatgagaa 3 66 0 

ctgtatgtag 3720 

tgataaaata 37 80 

ttaacgtgtg 3 840 

tggactgeiat 39 00 

tttctacttg 3960 

ttgctgatgc 402 0 

ttccctgg-ag 4080 

agcagtgcgtg 4140 

ttcttcatct 4200 

taaaagaatg 42 6 0 

gtattcttac 432 0 
aa 4372 



<210> 240 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM 014314 



<400> 240 

agttcagaca ctgtactcga a.gtggaagga ctttcatttt gagaagatac catttgatcc 



60 



<210> 241 

<211> 1647 

<212> DNA 

<213> Homo sapiens 



<300> 
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<308> NM_014321 
<400> 241 

gcgcgcgggt ttcgttgacc cgcggcgttc acgggaattg ttcgctttag tgccggcgcc 60 

atggggtcgg agctgatcgg gcgcctagcc ccgcgcctgg gcctcgccga gcccgacatg 12 0 

ctgaggaaag cagaggagta cttgcgcctg tcccgggtga agtgtgtcgg cctctccgca 180 

cgcaccacgg agaccagcag tgcagtcatg tgcctggacc ttgcagcttc ctggatgeiag 2 40 

tgccccttgg acagggctta tttaattaaa ctttctggtt tgaacaagga gacatatcag 3 00 

agctgtctta aatcttttga gtgtttactg ggcctgaatt caaatattgg aataagagac 3 60 

ctagctgtac agttta^ctg tatagaagca gtgaacatgg cttcaaagat actaaaaagc 42 0 

tatgagtcca gtcttcccca gacacagcaa gtggatcttg acttatccag gccacttttc 480 

acttctgctg cactgctttc agcatgcaag attctaaagc tgaaagtgga taaaaacaaa 540 

atggtagcca catccg^tgt aaaaaaagct atatttgatc gactgtgtaa acaactagag 600 

aagattggac agcaggtcga cagagaacct ggagatgtag ctactccacc acggaagaga 660 

aagaagatag tggttg<aagc cccagcaaag gaaatggaga aggtagagga gatgccacat 72 0 

aaaccacaga aagatgaaga tctgacacag gattatgaag aatggaaaag aaaaattttg 7 80 

gaaaatgctg ccagtgctca aaaggctaca gcagagtgat ttcagcttcc aaactggtat 840 

acattccaaa ctgatagtac attgccatct ccaggaagac ttgacggctt tgggattttg 9 00 

tttaaacttt tataateiagg atcctaagac tgttgccttt aaatagcaaa gcagcctacc 9 60 

tggaggctaa gtctgg^cag tgggctggcc cctggtgtga gcattagacc agccacagtg 102 0 

cctgattggt atagccttat gtgctttcct acaaaatgga attggaggcc gggcgca^tg 1080 

gctcacgcct gtaatcccag cactttggga ggccaaggtg ggtggatcac ctgaggtcag 1140 

gagctcgaga ccagcctggc caacatggtg aaaccccatc tctactaaaa atacaaaaat 12 0 0 

tagccaggtg tgatggtgca tgcctgtaat cccagctcct cagtaggctg agacaggagc 12 60 

atcacttgaa cgtggg<aggc agaggttgca gtgagccgag attgcaccac cgcactccag 132 0 

cctgggtgac agagcgagac ttatctcata aataaataga tagatactcc agcctggg-tg 13 80 

acagagcgag acttatagat agatagatag atagatggat agatagatag atagatagat 1440 

agatagataa acggaattgg agccattttg ctttaagtga atggcagtcc cttgtcttat 15 00 

tcagaatata aaattcagtc tgaatggcat cttacagatt ttacttcaat ttttgtgtac 15 60 

ggtatttttt atttgactaa atcaatatat tgtacagcct aagttaataa atgttattta 162 0 
tatatgcaaa aaaaaaaaaa aaaaaaa 1647 

<210> 242 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_014321 



<400> 242 

tgctttaagt gaatggcagt cccttgtctt attcagaata taaaattcag tctgaatggc 



60 



<210> 243 
<211> 1455 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_014364 



<400> 243 

ggcggtccgc 

gaggccatgt 

cagccgtgcc 

cagccacaac 

cctcctcacc 

ggcatcaatg 

gttaaggtgg 

aagtatgact 

gtcgtggaca 



acgcacctcg 
cgaagcgcga 
cggtgaccag 
cagagcccac 
ccgctactcc 
gatttgcjacg 
tggctgtgaa 
ccacccacgg 
accatgagat 



gtaacatcac 
catcgtcctc 
agcaccgccc 
accagtcagg 
tcctcctaag 
catcggtcgc 
tgatccattc 
ccgatacaag 
ctctgtctac 



agcaggtcca 
accaatgtca 
ccacctgagc 
gaggaaataa 
atggtgtctg 
ctggtcctgc 
attgacccgg 
ggaagtgtgg 
cagtgcaaag 



ggccaatgat 
ccgttgtcca 
ctaaggctga 
agccaccacc 
tggcccggga 
gcgcctgcat 
aatacatggt 
aattcaggaa 
agcccaaaca 



aaccttataa 
gttgctgoga 
agtagagccc 
gccaccactg 
gctgactgtg 
ggagaagg-gt 
gtacatgttt 
tggacaactg 
gatcccctgg 



60 

12 0 

180 

240 

300 

360 

420 

480 

540 
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agggctgtcg 
gcttcggacc 
gcaccaatgt 
gtgagcaacg 
cgatttggga 
acagtggacg 
atcccagcct 
aagctgacag 
tgccgcctcg 
aaggggccca 
ctcggtgata 
ttcgtgaagc 
ctcctccgct 
aggggccggg 
gacacccggg 
aaaaaaaaaa 



ggagccccta 
acatctc tgc 
tcgtcatcjgg 
cgtcctgcac 
tcgtgga-aigg 
ggccatcaag 
ccactggggc 
ggatggc^tt 
cccagcctgc 
tggctgg-cat 
cccactcgtc 
tcatttcatg 
acatgttcag 
gccggaacat 
gcgggcg-ccc 
aaaaa 145 5 



cgtggtggag 
aggtgctcaa 
tgtcaatgaa 
caccaactgt 
gttgatgacc 
gaaggcctgg 
tgcgaaagct 
ccgggtacca 
cccctactca 
ccttgcctac 
catcttcgat 
gtacgacaac 
ccgagacaag 
gtgcctcccg 
cacgccgatg 



tccacaggcg 
cgtgtggtca 
aatgactata 
ttggctcccc 
acagtccatt 
cgagatgggc 
gtgaccaaag 
accccggatg 
gccatcaagg 
accgaggatg 
gctaaggccg 
gaatatggct 
tgaaacggga 
ttccagcatc 
ggtccatggt 



tgtacctctc 
tctccgcgcc 
accctggctc 
tcgccaaagt 
cctacacggc 

ggggtgccca 

tcatcccaga 
tgtctgtcgt 
aggctgtaaa 
aggtcgtctc 
gcattgcgct 
acagtcaccg 
aggtcctttc 
tggctgcccg 
gaaataaaaa 



catacaggca 
ctcaccggat 
catgaacatt 
catccacgag 
cacccagaag 
ccagaacatc 
gctcaaaggg 
ggacctgacc 
agcagcagcc 
tacg-g<a.cttc 
caatgacaat 
ggtg-gtcgac 
tttccttccc 
ggggag-gaag 
acagtgctcg 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 



<210> 244 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_014364 



<400> 244 

cgctcaatga caatttcgtg aagctcattt catggtacga caacgaatat ggctacagtc 60 

<210> 245 
<211> 935 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> NM_014462 



<400> 245 

gaagtgggta agggtaatat ggaggagctt 
cagaagtgct ggtctcggtc gggattccgg 
aggagggaag aggtttt^ga cgcgctggcc 
agttcaaaat gaactateitg cctggcaccg 
acttggttct gcttcgacjat ggaaggacac 
ttgcaaactt agtgctacat cagactgtgg 
atattcctcg agggattttt gtggtcagag 
acttggaaaa ggagagtg-ac acacccctcc 
aacaaagggt ggaacagcag accaagctgg 
aggaccgagg tctttccatt cctcgagcag 
agaggctgtt ggctcttg-aa gagtaggggc 
cctcacgcat ttgatcacag actgtagagt 
ttacatatgc aacatgaaga aatcgtgtag 
gtttaaagaa acagtggcat agactccttc 
tttatattgt tcttcatettc ccaaattaga 
aataaaatat gaatctcaaa aaaaaaaaaa 



ccggcaggcc ccggcggctg aaagccgggg 60 

gcttggtccc accgaggcgg cgactgcggt 12 0 

tcccgccgct gtgcattgca gcatteitttc 180 

ccagcctcat cgaggacatt gacaaaaagc 240 

ttataggctt tttaagaagc attgatcaat 3 00 

agcgtattca tgtgggcaaa aaatacggtg 3 60 

gagaaaatgt ggtcctacta ggagaaatag 42 0 

agcaagtatc cattgaagaa attctagaag 480 

aagcagagaa gttgaaagtg caggccctga 540 

atactcttga tgagtactaa tcttttgccc 600 

tgtcactgag tgaaagtgac atcctggcca 660 

tttgaaaagt cacttttatt tttaattatt 72 0 

gtgggttttt tttttaataa caaaatcact 7 80 

acacatcact gtggcaccag caactacttc 840 

gtttacaggg acagtcttca tttacfctgta 9 00 
aaaaa 93 5 



<210> 246 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_014462 
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<400> 246 

ttaataacaa aatcactgtt taaagaaaca gtggcataga ctccttcaca catcactgtg 60 

<210> 247 
<211> 890 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM„0145Ol 



<400> 247 

ggcggaccga 

tgaactccaa 

cgacactgac 

ccgacctcca 

gcatgaaact 

ccaagatctt 

gggactggac 

tccaccctaa 

acgaggagta 

ccagcggcag 

accctggggc 

gcgagcgcga 

cgctgcggcg 

cccctccctc 

agggggcact 



agaacgcagg 
cgtggagaac 
cgcagaccca 
ggtcaccatc 
cctgctgggg 
ccacccgaac 
ggctgagctg 
ccccgagtct 
tgcggctcgg 
ggccgaagcc 
cccagggggc 
taagaagctg 
gctgtagtgg 
caactctgtc 
ggg"acctgga 



aagggggccg 
ctacccccgc 
cccgatggca 
gagggccctg 
aaggacttcc 
gtgggcgcca 
ggcatccgac 
gcactcaacg 
gcccgtctgc 
ggtcgggccc 
ccgggagggg 
gcggccaaga 
gctctcttcc 
tctaagttat 
tttgtttttc 



gggggacccg 

acatcatccg 
tcaaggtctt 
aggggacccc 
ctgcctcccc 
atggcgagat 
acgtactgct 
aggaggcggg 
tcacagagat 
tggccagtgg 
ctgagggtcc 
aaaagacgga 
tccttccacc 
ttaaattatg 
taaataaagt 



cccccggccg 
cctggtgtac 
tcccaacgag 
atatgctgga 
acccaagggc 
ctgcgtcaac 
gaccatcaag 
ccgcctgctc 
ccacgggggc 
cactgaagct 
catggccaag 
caagaagcgg 
gtgaccccaa 
gctggggtcg 
tggaaaagca 



gccgcagcca 60 

aaggaggtga 12 0 

gaggacctca 180 

ggtctgttcc 240 

tacttcctga 3 00 

gtgctcaaga 3 60 

tgcctgctga 42 0 

ttggagaact 480 

gccggcgggc 540 

tcctccaccg 600 

aagcatgctg 6 60 

gcgctgcggg 72 0 

cctctcctgt 780 

gggatg-ggtac 840 
890 



<210> 248 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_0145Ol 



<400> 248 

acacgtactg ctgaccatca agtgcctgct gatccaccct aaccccgagt ctgcactcaa 



60 



<210> 249 
<211> 1182 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_016095 



<400> 249 

gcggccggcg 

cctgcagctc 

gttaccatta 

ccttttaacc 

agacagaaat 

agggatcatg 

cttacgaagc 

accctggtca 

tttgtgagac 

accagcggga 



gcgtctcctc 
gcgccgccat 
tccccaactt 
ctggtttacc 
gtcgcctgct 
aacgaaagga 
tcctgttaaa 
agg-a.tatgtg 
agcaggaggc 
ctttcctcac 



ccgggacgct 
ggacgctgcc 
cagtctggac 
cgtggaagtg 
ccctccagag 
agaaactttt 
tcatgcttca 
ggacactcgt 
acatgccaag 
acaagcgctc 



gaggggcccg 
gaggtcgaat 
aagatctacc 
cccctgtggc 
tggatggatg 
accccaatgc 
gacaacatcc 
atagccaaac 
ctggataact 
aaccacatgt 



aggagaccgt 
tcctcgccga 
tcatcggggg 
tggcgattaa 
tagaaaagtt 
ccagccctta 
cgaaggcaga 
tccgagtgtc 
tgaccttgat 
acaaactccg 



gagg-ctctgg 6 0 

gaacjgagctg 12 0 

ggaoctgggg 180 

cctgaaacaa 240 

ggagaagatg 3 00 

ctacatggaa 3 60 

cgaaatccgg 42 0 

tgctgacagc 480 

ggagatcaac 540 

cacgaacctc 600 
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cagcctctgg agagtactca gtctcaggac ttctagagaa aggcctggtg cag-g-cggctt 660 

gctgggggat gtgagcgctc aggatgtgat gaggtactcg tggttctgga gctctagaaa 72 0 

cacttctgat gcatgaaaaa tgtgtgatgg tgcaaggaat ggattcagga tgttgttgga 7 80 

gaaacaagtt tgtgattagt ccttaaaact tagctccctg ggacattctt caattccaca 840 

tctgtttcta gaaaccagcc ctttttcccc ccacttttga gaaataaaaa agccttaggt 900 

aaataagtca ttctccctag cagagccact tgggtctcct gcatggaagc cgtcacactt 960 

gggcaggtgt tcagtgactg gtaggtgtag atacagcagg agtggccatg tggtccacgg 102 0 

ctttttaccc cttcttgatc ctgatttctt gggctgaatt tagactctct cacagaggtg 10 80 

gctcacagag aaggatggca gatggtgcag ccaacaatgc tgaccggtgc ttatcctcta 1140 
agccctgatc cacaataaaa atggacccaa ctcaaaaaaa aa 1182 

<210> 250 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM__016095 
<400> 250 

atggattcag gatgttgttg gagaaacaag tttgtgatta gtccttaaaa cttagctccc 6 0 

<210> 251 
<211> 704 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_016185 
<400> 251 

tgcagcggtg gtcgg-ctgtt gggtgtggag tttcccagcg cccctcgggt ccgacccttt 6 0 

gagcgttctg ctccgrcrcgcc agcctacctc gctcctcggc gccatgacca caaccaccac 12 0 

cttcaaggga gtcgacccca acagcaggaa tagctcccga gttttgcggc ctccaggtgg 180 

tggatccaat ttttcattag gttttgatga accaacagaa caacctgtga ggaagaacaa 240 

aatggcctct aatatctttg ggacacctga agaaaatcaa gcttcttggg ccaagtcagc 3 00 

aggtgccaag tctagrtggtg gcagggaaga cttggagtca tctggactgc agagaaggaa 3 60 

ctcctctgaa gcaagctccg gagacttctt agate tgaag ggagaaggtg atattcatga 420 

aaatgtggac acagacttgc caggcagcct ggggcagagt gaagagaagc ccgtgcctgc 480 

tgcgcctgtg cccagcccgg tggccccggc cccagtgcca tccagaagaa atccccctgg 540 

cggcaagtcc agcctcgtct tgggttagct ctgactgtcc tgaacgctgt cgttctgtct 600 

gtttcctcca tgcttgagaa ctgcacaact tgagectgae tgtacatctt cttcpgatttg 660 
tttcattaaa aagaagcact ttatgtaaaa aaaaaaaaaa aaaa 7 04 

<210> 252 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_016185 
<400> 252 

tgaaccaaca gaacaacctg tgaggaagaa caaaatggee tctaatatct ttgggacacc 60 

<210> 253 
<211> 2268 
<212> DNA 

<213> Homo sapiens 
<220> 
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<221> Modified_ba.se 
<222> 1 . . . 2268 
<223> n = a,c,g, or t 

<300> 

<308> NM„016359 
<400> 253 

gggatttgaa ccncgctgac gaagtttggt gatccatctt ccgagtatcg ocgggatttc 60 

gaatcgcgat gatcatcccc tctctagagg agctggactc cctcaagtac eigtgacctgc 12 0 

agaacttagc caagagtctg ggtctccggg ccaacctgag ggcaaccaag ttgttaaaag 180 

ccttgaaagg ctacattaaa catgaggcaa gaaaaggaaa tgagaatcag g;atgaaagtc 2 40 

aaacttctgc atcctcttgt gatgagactg agatacagat cagcaaccag gaagaagctg 3 00 

agagacagcc acttggccat gtcaccaaaa caaggagaag gtgcaagact g-tccgtgtgg 3 60 

accctgactc acagcagaat cattcagaga taaaaataag taatcccact g-aattccaga 42 0 

atcatgaaaa gcaggaaagc caggatctca gagctactgc aaaagttcct: tctccaccag 480 

acgagcacca agaagctgag aatgctgttt cctcaggtaa cagagattca aaggtacctt 540 

cagaaggaaa gaaatctctc tacacagatg agtcatccaa acctggaaaa aataaaagaa 600 

ctgcaatcac tactccaaac tttaagaagc ttcatgaagc tcattttaag cjaaatggagt 6 60 

ccattgatca atatattgag agaaaaagaa acattttgaa gaacacaatt ccatgaatga 72 0 

actgaagcag cagcccatca ataagggagg ggtcaggact ccagtacctc caagaggaag 7 80 

actctctgtg gcttctactc ccatcagcca acgacgctcg caaggccggt cttgtggccc 840 

tgcaagtcag agtaccttgg gtctgaaggg gtcactcaag cgctctgcta tctctgcagc 9 00 

taaaacgggt gtcag^tttt cagctgctac taaagataat gagcataagc Qttcactgac 960 

caagactcca gccagaaagt ctgcacatgt gaccgtgtct gggggcaccc caaaaggcga 102 0 

ggctgtgctt gggacacaca aattaaagac catcacgggg aattctgctg ctgttattac 10 80 

cccattcaag ttgacaiactg aggcaacgca gactccagtc tccaataaga aaccagtgtt 1140 

tgatcttaaa gcaagtttgt ctcgtcccct caactatgaa ccacacaaag gaaagctaaa 12 00 

accatggggg caatctaaag aaaataatta tctaaatcaa catgtcaaca Qaattaactt 12 60 

ctacaagaaa acttacaaac aaccccatct ccagacaaag gaagagcaac Qgaagaaacg 132 0 

cgagcaagaa cgaaaggaga agaaagcaaa ggttttggga atgcgaaggg g-cctcatttt 13 80 

ggctgaagat taataatttt ttaatatctt gtaaatattc ctgtattctc aacttttttc 1440 

cttttgtaaa tttttttttt tttgctgtca tccccacttt agtcacgaga tctttttctg 1500 

ctaactgttc atagtctgtg tagtgtccat gggttcttca tgtgctatga tctctgaaaa 1560 

gacgttatca ccttaaiagct caaattcttt gggatggttt ttacttaagt ccattaacaa 162 0 

ttcaggtttc taacgagacc catcctaaaa ttctgtttct agatttttaa tgtcaagttc 1680 

ccaagttccc cctgctggtt ctaatattaa cagaactgca gtcttctgct agccaatagc 1740 

atttacctga tggcagctag ttatgcaagc ttcaggagaa tttgaacaat aacaagaata 1800 

gggtaagctg ggatagaaag gccacctctt cactctctat agaatatagt siacctttatg 1860 

aaacggggcc atatagtttg gttatgacat caatatttta cctaggtgaa sittgtttagg 192 0 

cttatgtacc ttcgttcaaa tatcctcatg taattgccat ctgtcactca ctatattcac 19 80 

aaaaataaaa ctctacaact cattctaaca ttgcttactt aaaagctaca tagccctatc 2040 

gaaatgcgag gattaatgct ttaatgcttt tagagacagg gtctcactgt g-ttgcccagg 2100 

ctggtctcaa actccaccaa atgtacttct tattcatttt atggaaaaga ctaggctttg 2160 

cttagtatca tgtccsLtgtt tccttcacct cagtggagct tctgagtttt atactgctca 222 0 
agatcgtcat aaataaaatt ttttctcatt gtcaaaaaaa aaaaaaaa 22 68 

<210> 254 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_016359 
<400> 254 

acattgctta cttaaaiagct acatagccct atcgaaatgc gaggattaat g-ctttaatgc 60 

<210> 255 
<211> 1590 
<212> DNA 
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<300> 

<308> NQVI_016816 



<400> 2 55 

gaggcag-ttc 

aaatctctgg 

atcgaccatg 

tcctaccctg 

ctcagag-g-cc 

gatcagttaa 

caaagagaga 

cgtgcgctca 

ctgcctgcct 

tatgtcasigc 

ttcacagaac 

cgcctagtca 

tatgccctgg 

aacacagccc 

atctactcjga 

cagctcacga 

ggtggaga.cc 

ccatgcttta 

aacagtacag 

acacatgagt 

caggcagaag 

ggctccag-tg 

gacaaagctc 

tggccttcta 

acctattctc 

ttccagcctt 

taacaataaa 



tgttgccact 

acaagttcat 

ccattgacat 

tgtgtgtgtc 

gatctgacgc 

atcgccgggg 

gagcactttc 

gcttcgtact 

ttgatgccct 

tcatcgagga 

tacagagaga 

agcactggta 

agctcctgac 

aaggatttcg 

caaagtatta 

aacccaggcc 

caaagggttg 

agaattggga 

acgatgagac 

accctcattt 

aggactggac 

ttatctggac 

ctcagtgagc 

tgccctctat 

tgaaaatatt 

gactttcttc 

aataaagcaa 



ctctctcctg 

tgaagactat 

catctgtggg 

caaggtggta 

tgacctggtt 

agagttcatc 

cgtgaagttt 

gagttcgctc 

gggtcagttg 

gtgcaccgac 

cttcctgaag 

ccaaaattgt 

ggtctatgct 

gacggtcttg 

tgactttaaa 

tgtgatcctg 

gaggcagctg 

tgggtcccca 

cgacgatccc 

ctctcataga 

ctgcaccatc 

cagttccttc 

tggtgtataa 

cctatcatag 

ccctgagaga 

tgtgcacctg 

ataccaaaaa 



tcaatgatgg 
ctcttgccag 
ttcctgaagg 
aagggtggct 
gtcttcctca 
caggaaatta 
gaggtccagg 
cagctcgggg 
actggcagct 
ctgcagaaag 
cagcgcccca 
aagaagaagc 
tgggagcgag 
gaattagtca 
aaccccatta 
gacccggcgg 
gcacaagagg 
gtgagctcct 
aggacgtatc 
cccagcacgc 
ctctgaatgc 
attttcaggt 
tccaagacag 
ataacattct 
gaacagagag 
atgggagggt 
1590 



atctcagaaa 

acacgtgttt 

aaaggtgctt 

cctcaggcaa 

gtcctctcac 

ggagacagct 

ctccacgctg 

agggggtgga 

ataaacctaa 

agggcgagtt 

ccaagctcaa 

ttgggaagct 

ggagcatgaa 

taaactacca 

ttgaaaagta 

accctacagg 

ctgaggcctg 

ggattctgct 

agaaatatgg 

tccaggcagc 

cagtgcatct 

gggactcttg 

aacccaagtc 

ccacagcctc 

atttagataa 

aatgtctaat 



taccccagcc 60 

ccgcatgcaa 12 0 

ccgaggtagc 180 

gggcaccacc 240 

cacttttcag 300 

ggaagcctgt 3 60 

gggcaacccc 42 0 

gttcgatgtg 480 

cccccaaatc 540 

ctccacctgc 600 

gagcctcatc 660 

gccacctcag 720 

aacacatttc 780 

gcaactctgc 840 

cctgagaagg 9 00 

aaacttgggt 9 60 

gctgaattac 1020 

ggctgaaagc 1080 

-ttacattgga 1140 

atccacccca 1200 

"t gggggaaag 12 60 

atccagagaa 132 0 

tcctgactcc 1380 

acttcattcc 1440 

gagaatgaaa 15 0 0 

gtattatcaa 1560 



<210> 25 6 

<211> 60 

<212> DKTA 

<213> Hoxno sapiens 

<300> 

<308> NM_016816 



<400> 25 6 

cgatcccagg acgtatcaga aatatggtta cattggaaca catgagtacc ctcatttctc 60 

<210> 257 

<211> 29 G5 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> NM_016817 



<400> 25V 

cggcagccag ctgagagcaa tgggaaatgg 

gaagctggcgt tggtttatcc aggaatacct 

cgacgagatg gtgaacacca tctgtgacgt 

gcagggagtg gccataggtg gctcctatgg 

tggtaccctt gtccttttct tcagtgactt 

acgtgacatc ctcgataaaa ctggggataa 



ggagtcccag ctgtcctcgg tgcctgctca 60 

gaagccctac gaagaatgtc agacactgat 12 0 

ctgcaggaac cccgaacagt tccccctggt 180 

acggaaaaca gtcttaagag gcaactccga 240 

aaaacaattc caggatcaga agagaagcca 3 00 

gctgaagttc tgtctgttca cgraagtggtt 3 60 
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gaaaaacaat 
aaatcagaga 
tcccagcccc 
tggtgagttt 
aaaactaaag 
caaggattta 
acaggggtgc 
gatcaaatgc 
gaccatcagg 
agttgaccca 
agctcaaacc 
tgtcctgcct 
gtttctccag 



ttcgagatcc 
atctctttcg 
tggatctatc 
gcagtctgct 
gatttgatcc 
ccctcgctgt 
agaaaagaca 
caggagaagc 
aacatcctgc 
accaataatg 
fcggttgactt 
gcaccactct 
cccaacaaat 



agaagtccct 
aggtgctggc 
gagagctcaa 
tcactgaact 
tcttgataaa 
ctccgtatgc 
actttgacat 
tgtgtatcta 
tgcaccagct 
tgagtggaga 
ctcccaacct 
tcacgacccc 
gcttcctaga 



tgatgggtcc 
cgccttcaac 
aagatccttg 
ccagcagaag 
gcactggcat 
cctggagctg 
tgctgaaggc 
ttggatggtc 
ccaatcagcg 
taaaatatgc 
ggataatgag 
aggccacctt 
gcagattgac 



accatccagg 
gctctgagct 
gataagacaa 
ttttttgaca 
caacagtgcc 
cttacggtgt 
gtcagaacgg 
aactacaact 
aggccagtaa 
tggcaatggc 
ttacctgcac 
ctggataagt 
agtgctgtta 



tgttcacaaa 
taaatgataa 
atgccagtcc 
accgtcctgg 
agaaaaaaat 
atgcctggga 
ttctggagct 
ttgaagatga 
tcttggatcc 
tgaaaaaaga 
catcttggaa 
tcatcaagga 
acatcatccg 



tacattcctt 

aggatcaacc 

ccataactca 

aatccatgaa 

tgagcctccc 

caacgaaagt 

tggctccaca 

cctcccggga 

ccggcccacc 

aaggaaactg 

ctatgcctgg 

agtcctggag 

ctttgaagat 

gatcttggac 

tcttctggac 

aaacccaata 

gatccatcct 

ttctaaaaga 

agaaagagtc 

tcttggactt 

ctcctctctg 

ttctgtcaac 

cagtctaaaa 

ctcataactc 

tctgtataca 

gccaaccctt 

tttatatgag 

tcctcttgcc 

taggtgcttt 

gtgatacaat 



aaagaaaact 

gccaaaggca 

cttaaaagct 

cagctgaaag 

aagtggaagg 

g-tcagctttg 

c ccagccccg 

g-g-agagtttt 

aaactaaagg 

aagccaaagg 

gagcagggga 

ctggtcacac 

gagaccgtga 

ccaggcgaac 

aaagaagcaa 

ccaccttgga 

attgtcaatg 

aacttctgga 

actcacatcc 

cttgaaatca 

caggagccct 

tfcgtattttt 

aaggaatcct 

tg-tgatcttg 

aaaccctttc 

tcattagaac 

gotgttgtct 

aaagcaaaag 

aacctcacat 

aaagaatgta 



gcttccgaca 

cagctctgaa 

acacctccca 

ccttttggag 

ctcccagggt 

atgtgcttcc 

aggtttatgc 

ctacctgttt 

atttaattcg 

ggtctttgcc 

gtggagtgcc 

aatatcagca 

ggaagtttct 

ccacaggtga 

aggttaggtt 

aagtgccgac 

agatgttctc 

gatcatctgg 

attcttccct 

atcaagactg 

gcttaaaata 

ttttcttgta 

ctgtgtcttc 

ctctcggtgc 

tgcccctgct 

ttcaagctct 

tttccttctg 

actttttcct 

accctcactt 

tctgg 2905 



atcaacagcc 

gactggctct 

aaaaaacgag 

ggagaaggag 

gctgagcttc 

tgcctttaat 

agggctcatt 

cacagtcctg 

cctggtgaag 

cccaaagtat 

ggattttgac 

gctcggcatc 

actgagccag 

cgtgggtgga 

atcctctccc 

aatgcagaca 

atccagaagc 

caatcgcttt 

tgatggtccc 

caaacccttt 

gttgatgtca 

tttttccaat 

aaagcaaagc 

ttccaactca 

gacacagaca 

ccaaaggctc 

agcctgcctt 

tggctttagc 

aaacttttat 



aagatccaga 

gatgccgatc 

cggcacaaaa 

gaggagcttg 

tctctgaaat 

gcactgggtc 

gatctgtata 

cagcgaaact 

cactggtaca 

gccttggagc 

actgcagaag 

ttctggaagg 

ttgcagaaaa 

ggggaccgtt 

tgcttcaagg 

ccaggaagtt 

catagaatcc 

taaagactcg 

tattcctcct 

cataaagctg 

tcactttatg 

tagctcctcc 

tctttacttt 

tccacgtcct 

tcctctatgc 

agattataac 

tatcccccca 

cttaaagata 

cactgttgca 



-ttgtccgggg 

-fccgtcgtgtt 

tcgtcaagga 

aagtcagctt 

ccaaagtcct 

agctgagttc 

aatcctcgga 

tcattcgctc 

sagagtgtga 

tgctcaccat 

cgtttccggac 

tcaattacaa 

ccaggcctgt 

g-gtgttggca 

aitgggactgg 

g-tggagctag 

tgaataataa 

gctcaccgtg 

tcccttgcct 

ccttgctgaa 

tgcatcttat 

tttttccttc 

ccccttggtt 

g-tctgtttcc 

cagcagccag 

tgttgtcata 

cccaggagta 

c ttgaaggtc 

tataccagtt 



420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 



<210> 258 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_016817 
<400> 258 

aaggtctagg tgctttaacc tcacataccc tcacttaaac ttttatcact gttgcatata 60 

<210> 259 
<211> 2054 
<212> DNA 

<213> Homo sapiens 
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<300> 

<308> NM_017 414 



<400> 259 

gggaagctcg 

cgctgtaagt 

cggggccaga 

gtcccgacgt 

tcgtgcctgg 

caggcagctg 

tgaggcaaat 

aaaagaagga 

gggactaccc 

ccttgattca 

tgcccagggg 

agaagatgca 

agaagtgcaa 

ggaacctgat 

atacgatccg 

acagcagcat 

cactggagga 

gcttctgtga 

tgccccagac 

agatctgcca 

agcgagagtc 

ttgcgcacgt 

atggaaaatg 

agtgtaccta 

tgaagatgga 

ccatttccgt 

gttttcaaac 

catgaggccc 

tgtggctgct 

cctgtgggaa 

gccaaaggtc 

tactggctga 

tatgaatcaa 

cttctccata 

aaaaaaaaaa 



ggccggcagg 
ttcgctttcc 
ocaaggcggg 
ggaactcagc 
o tcacataag 
cg-gcctgggg 
ctgtcagtcc 
agaagacagc 
tcatggcctg 
g-gtgttcgta 
agctgacgag 
ggacagccgg 
cgtgcccttg 
taaggaccag 
ggtgaaggac 
gctcaccctc 
cgccctgcac 
Q-a.actgtggg 
cctgacaatc 
ctccctgtac 
t tgtgatgct 
gggaatggca 
cj-ttctgcttc 
cggaaatcct 
cj*tgctaatgg 
tcctggatct 
tatataactg 
o tcaggtcct 
cggtcctggg 
cttcaggggt 
a_cjtggcaggg 
atatcagtgc 
grtgttttgta 
a_gatagtgtg 
aaaa 2054 



gtttccccgc 
attcagtgga 
cccggagcgg 
agcggaggct 
cgcttcctgg 
gttttggagt 
atcctggctg 
aacatgaaga 
gttggtttac 
atgaatgtgg 
cagaggagaa 
cagaaagcag 
tttgtccaac 
atcactgatg 
tccttgattt 
ccactttctc 
tgcttcttcc 
aagaagaccc 
cacctcatgc 
ttcccccaga 
gaggagcagt 
gactccggtc 
aatgactcca 
aactaccact 
aaatgcccaa 
acggagtctt 
agccttattt 
gatcagtcag 
tgctcgctgc 
tcccagtggg 
ggtatttcag 
tgtttgtaat 
actgctattc 
ataaacacag 



acgctggcgc 
aaacgaaagc 
aacttcggtc 
ggacgcttgc 
aagtgaagtc 
gatcacgaat 
agtcctcgca 
gagagcagcc 
acaacattgg 
acttcaccag 
gcgtcccttt 
tgcggcccct 
atgatgctgc 
tgcacttggt 
gcgttgactg 
tttttgatgt 
agcccaggga 
gtgggaaaca 
gattctccat 
gcttggattt 
ctggagggca 
attactgtgt 
atatttgctt 
ggcaggaaac 
aaccttcaga 
ctaagagatt 
ataattaggg 
aatggatgct 
tgtgcaagac 
gagagcagtg 
tattatacaa 
ttttcacttt 
atttattcag 
tcatgaataa 



ccagctcccg 
tgggcggg-gt 
ccagctcggt 
atggcgcttg 
gtgctgtcct 
gagcaaggcg 
gtccccggca 
cagagagogt 
acagacctgc 
gatattgatag 
ccagatgctt 
ggagctgg-cc 
ccaactgtac 
ggagagactg 
tgccatgcfag 
ggactcaaag 
gttatcaaLgc 
ggtcttgaLag 
caggaattca 
cagccagatc 
gtatgagctt 
ctacatccgg 
ggtgtccfcgg 
tgcatatctt 
gattgacacg 
ttgcaatg-ag 
atattatcaa 
ttcaccagca 
attagccctt 
gcagtggcjag 
ctgctgtcj-ac 
gagaaccaac 
caaatattta 
agttattttc 



gcgcggaggc 
gccacgagcg 
ccccggctca 
agagattcca 
gaacgcgggc 
tttgggctcc 
gatcttgaag 
cccagggcct 
tgccttaact 
aggatcacgg 
ctgctgctgg 
tactgcctgc 
ctcaaactct 
caggccctgt 
agtagcagaa 
cccctgaaga 
aaaagcaagt 
ctgacccatt 
cagacgagaa 
cttccaatga 
tttgctgtga 
aatgctgtgg 
gaagacatcc 
ctggtttaca 
ctgtcatttt 
gagaagcatt 
aatatgtaac 
gacccggcca 
tagttatgag 
gcatctgggg 
cagacttgta 
attaattcca 
ttgatcatct 
cacaaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 



<210> 260 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_017 414 
<400> 260 

tgagcatctc ttctccataa gatagtgtga taaacacggt catgaataaa gttattttcc 

<210> 261 
<211> 3638 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM 017 523 



60 



<400> 261 

ggtagatgcg gctgtgacag cagcaaagaa tgacggccaa gggcgaca_g-c aggggctggc 



60 
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catgctgtaa sggggcttct tgggagggtc cagcctcagg aatcaagggg aactcctgag 12 0 
ccgagaattc tgaagatctc ctccctccct gaagctgtgg gctgggccat cggaaaactt 180 
tcagttttgt ttccttgcct gcaagaaacg aaactcaacc gaaagcctgc agagagcaga 240 
acatggaagg agacttctcg gtgtgcagga actgtaaaag acatgtagtc tctgccaact 3 00 
tcaccctcca tgaggcttac tgcctgcggt tcctggtcct gtgtccggag tgtgaggagc 3 60 
ctgtccccaa ggaaaccatg gaggagcact gcaagcttga gcaccagcag gttgggtgta 42 0 
cgatgtgtca gcagagcatg cagaagtcct cgctggagtt tcataaggcc aatgagtgcc 480 
aggagcgccc tgttgagtgt aagttctgca aactggacat gcagctcagc aagctggagc 540 
tccacgagtc ctactgtggc agccggacag agctctgcca aggctgtcjgc cagttcatca 600 
tgcaccgcat gctcgcccag cacagagatg tctgtcgcag tgaacagcjcc cagctcggga 660 
aaggggaaag aatttcagct cctgaaaggg aaatctactg tcattattgc aaccaaatga 72 0 
ttccagaaaa taagtatttc caccatatgg gtaaatgttg tccagactca gagtttaaga 780 
aacactttcc tgttggaaat ccagaaattc ttccttcatc tcttcca^gt caagctgctg 840 
aaaatcaaac ttccacgatg gagaaagatg ttcgtccaaa gacaagaagt ataaacagat 900 
ttcctcttca ttctgaaagt tcatcaaaga aagcaccaag aagcaaaaac aaaaccttgg 960 
atccactttt gatgtcagag cccaagccca ggaccagctc ccctagagga gataaagcag 102 0 
cctatgacat tctgaggaga tgttctcagt gtggcatcct gcttcccctg ccgatcctaa 1080 
atcaacatca ggagaaatgc cggtggttag cttcatcaaa aggaaaacaa gtgagaaatt 1140 
tcagctagat ttggaaaagg aaaggtacta caaattcaaa agatttcact tttaacactg 12 00 
gcattcctgc ctacttgctg tggtggtctt gtgaaaggtg atgggtttta ttcgttgggc 1260 
tttaaaagaa etaggtttggc agaactaaaa acaaaactca cgtatcatct caatagatac 132 0 
agaaaaggct tttgataaaa ttcaacttga cttcatgtta aaaaccctca acaaaccagg 13 80 
cgtcgaagga acatacctca aaataataag agccatctat gacaaaacca cagccaacat 1440 
catactgaat gagcaaaagc tggagcatta ctcttgagaa gtagaacaag gcacttcagt 15 0 0 
cctattcaac atagtactgg aagtcctcgc cacagcaatc aggcaagaga aagaaataaa 1560 
aggcaaccaa aaagaaagga agtcgaagta tctctgtttg cagacgatat gattctatat 1620 
ctagaaaacc ccatgatctt ggcccaaaag ctcctagatc tgataaacaa cttcagctaa 1680 
ctttcaggag acaaaatcaa tatacaaaat atggtagcat ttttatacac caacgacatc 1740 
caagctgaga gccaaatcaa gaatgcaatc ctattcacaa ttgccacaaa aagaataaaa 1800 
tacctaggaa tacagctaac cagggagatg aaagatctct acaacaaaaa ttacaaaaca 1860 
ctgctgaaag aaatcagaga tgacacaaat ggaaaaacat tccatactta tggataggaa 192 0 
gaatcaatat tgttaaaatg gccatactac ccaaagcaat ttatagattc aatgctattc 1980 
ctatcaaact accaataaca ttcttcacag aatcagaaaa aaaaagcatt aaaatttatt 2040 
tgaaaccaaa aaagagccca aaaagccaaa gcaatcctaa gcaaaaagaa caaagctgga 2100 
ggcatcgcat tacccaactt caaactatac tacagggcta cagtaaccaa aactgcatga 2160 
tactggtaca aaagcatggt gctggtacaa aagcagacac atagatcaat ggaacagaat 222 0 
agagggccca gaaataaagc tacacaccta caaccatcta atctttgaca aagttgacaa 22 80 
aaatacgcaa tggggaaaga attccccatt cagtaagtgg tactgggata actagctagc 2340 
catatgcaga ggattgaaac tgaaccactt ccttacacca tatgcaaaaa tcaactcaag 2400 
atggattaaa gacttaaatg taaaacccca aactataaaa actctggasig ataacctagg 2 460 
caataccatt ctggacatag gaacggaaaa agatttcatg acaaagatcc caaaaataat 2520 
tgtaacgaaa gcaaaaattg acaaatggga catgattaaa cagaattacc atttgactca 2 5 80 
gcaatcccat tattggttat atacccaaag gaatctaaat cattctgtca taaagacata 2640 
tatacacaaa tgttcacggc agcactatac acaatcgcaa agtcagggaa tcaaactaaa 2700 
tgtccatcag tcjgtagaaag gataaagaaa atgtggtggc agggagtgg-t ggctcatgtc 2760 
tgtaatccca gcactttggg aggctgaggc gggtggttca cctgaggtca ggagtttgag 2 82 0 
accagcctgg ccaacatggc gaaactccgt ctccgctaaa aatacgaaaa ttagccaggc 2 880 
gtggtggcga gcacctgtca tcccagctac ttgggaggcc taggcgtgag aatcgcttga 2940 
acctggaagg tcrgtggttgc agtgagccga gatcctgcca ctgcactcca gcctgggcaa 3 000 
ccaagcgaga ctctgcctta aaaaaaaaaa aaagaaaatg tggcacatat acaccatgga 3 060 
atactatgca gccataaaaa agaatgggat catgtcctgt gcagcaacgt ggatggagct 312 0 
ggaagccatt atcctaaatg aactcactca gaaacagaaa accaaatacc acatgttctc 3180 
acttataagt agaagctaaa cattgagtac acatggatac aaagaaggga accgcagaca 3240 
ctggggccta cctgaggtcg gagcatggaa ggagggtgag gatcaaaaaa ctacctatct 3300 
ggtactatgc tttttatctg gatgatgaaa taatctgtac aacaaaccot ggtgacatgc 33 60 
aatttaccta tatagcaagc ctacacatgt gcccctgaac ctaaaaaaaa agttaaaaga 342 0 
aaaacgtttg gattattttc cctctttcga acaaagacat tggtttgccc aaggactaca 3480 
aataaaccaa cgggaaaaaa gaaaggttcc agttttgtct gaaaattctg attaagcctc 3540 
tgggccctac agcctggaga acctggagaa tcctacaccc acagaacccg gctttgtccc 3600 
caaagaataa aaacacctct ctaaaaaaaa aaaaaaaa 3 63 8 
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<210> 262 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_017523 
<400> 262 

ttggaaaagg aaaggtacta caaattcaaa agatttcact tttaacactg gcattcctgc 60 

<210> 263 
<211> 2461 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM„018410 
<400> 263 

atgctgggta cgctgcgcgc catggagggc gaggacgtgg aagacgacca gctgctgcag 60 

aagctcaggg ccagtcgccg ccgcttccag aggcgcatgc agcggctgat agagaagtac 12 0 

aaccagccct tcgaggacac cccggtggtg caaatggcca cgctgaccta cgagacgcca 180 

cagggattga g-aatttgggg tggaagacta ataaaggaaa gaaacaaagg agagatccag 2 40 

gactcctcca tgaagcccgc ggacaggaca gatggctccg tgcaagctgc agcctggggt 3 00 

cctgagcttc cctcgcaccg cacagtcctg ggagccgatt caaaaagcgg tgaggtcgat 3 60 

gccacgtcag accaggaaga gtcagttgct tgggccttag cacctgcagt gcctcaaagc 42 0 

cctttgaaaa atgaattaag aaggaaatac ttgacccaag tggatatact gctacaaggt 480 

gcagagtatt ttgagtgtgc aggtaacaga gctggaaggg atgtaccjtgt gactccgctg 540 

ccttcactgg cctcacctgc cgtgcctgcc cccggatact gcagtccj-tat ctccggaaag 600 

agtcctggtg acccagcgaa accagcttca tctcccagag aatgggatcc tttgcatcct 660 

tcctccacag acatggcctt agtacctaga aatgacagcc tctccctaca agagaccagt 72 0 

agcagcagct tcttaagcag ccagcccttt gaagatgatg acatttcj-caa tgtgaccatc 7 80 

agtgacctgt acgcagggat gctgcactcc atgagccggc tgttgacjcac aaagccatca 840 

agcatcatct ccaccaaaac gttcatcatg caaaactgga actgcag-gag gaggcacaga 900 

tataagagca ggatgaacaa aacatattgc aaaggagcca gacgttctca gaggagctcc 960 

aaggagaact tcataccctg ctctgagcct gtgaaaggga caggggcatt aagagattgc 102 0 

aagaacgtat tagatgtttc ttgccgtaag acaggtttaa aattggaaaa agcttttctt 1080 

gaagtcaaca gaccccaaat ccataagtta gatccaagtt ggaaggagcg caaagtgaca 1140 

ccctcgaagt attcttcctt gatttacttc gactccagtg caacatataa tcttgatgag 12 00 

gaaaatagat ttaggacatt aaaatggtta atttctcctg taaaaatagt ttccagacca 12 60 

acaatacgac agggccatgg agagaaccgt cagagggaga ttgaaat ccg atttgatcag 132 0 

cttcatcggg aatattgcct gagtcccagg aaccagcctc gccggatg-tg cctcccggac 13 80 

tcctgggcca tgaacatgta cagagggggt cctgcgagtc ctggtgg-cct tcagggctta 1440 

gaaacccgca ggctgagttt accttccagc aaagcaaaag caaaaag-ttt aagtgaggct 1500 

tttgaaaacc taggcaaaag atctctggaa gcaggtaggt gcctgcccaa gagcgattca 15 60 

tcttcatcac ttccaaagac caaccccaca cacagcgcaa ctcgcccgca gcagacatct 162 0 

gaccttcacg ttcagggaaa tagttctgga atatttagaa agtcagt^tc acccagcaaa 1680 

actctttcag tcccagataa agaagtgcca ggccacggaa ggaatcg"tta cgatgaaatt 1740 

aaagaagaat ttgacaagct tcatcaaaag tattgcctca aatctcc-tgg gcagatgaca 1800 

gtgcctttat gtattggagt gtctacagat aaagcaagta tggaagt "teg atatcaaaca 1860 

gaaggcttct taggaaaatt aaatccagac cctcacttcc agggtttcca gaagttgcca 192 0 
tcatcacccc t^gggtgcag aaaaagtcta ctgggctcaa ctgcaattga ggctccttca -1980 

tctacatgtg ttgctcgtgc catcacgagg gatggcacga gggaccatca gttccctgca 2 040 

aaaagaccca ggctatcaga accccagggc tccggacgcc agggcaa-ttc cctgggtgcc 2100 

tcagatgggg tggacaacac cgtcagaccg ggagaccagg gcagctcttc acagcccaac 2160 

tcagaagaga gaggagagaa cacgtcttac aggatggaag agaaaagtga tttcatgeta 2220 

gaaaaattgg aaactaaaag tgtgtagcta ggttatttcg gagtgttatt tatcttccca 22 80 

cttgctctct gtttgtattt ttgttttgtt tttgattctt gagactgtga ggacttggtt 2340 

gacttctctg cccttaaagt aaatattagt gaaattggtt ccatcagaga taacctcgag 2400 

ttcttggtgt agaaattatg tgaataaagt tgctcaatta gaaaaaaaaa aaaaaaaaaa 2460 
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a 2461 

<210> 264 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM__O18410 
<400> 264 

agtgatttca. tgctagaaaa attggaaact aaaagtgtgt agctaggttsi tttcggagtg 60 

<210> 265 
<211> 1405 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_018455 
<400> 265 

cacctcgctc gcagcctccc cagcgcagca gcccggctgt gggcctgcgcj cagccgggtc 60 

ttcctggtcc ccacctcctg gggccgacgg gcggcaggaa ggggctcggc gggacgcgcc 12 0 

gtcagggacc tgaggaggaa caacggaacg cgttcggaac ggcctggact cccgagactc 180 

acccgactcg tggccacacc gggagaactg aagcggcagt agccggcgga gacgcccgac 240 

ccgaaggccg gctgctaggg agcagacagc tgaaccgctt gccagacgcc gaaacccagt 3 00 

gacgccctcc accgctccac cgtgctcccg gctccccgcc cccgccgccc gcgggcccca 3 60 

aggcgcatgc gccgcctgtc ctggaggggc ccatttccgt ccgtcgtggg- gggaggcaca 42 0 

gtgagtccac tggggcacgg cagcgtctaa gccacaagcc gagcacataa gccaggtcct 480 

aacggagcct atgtgtaagt ccactactgg tgcaaggttg cacacttcta agaagagcgg 540 

cgtggggggc tcggcgacct tcgcttcagt cgctcccccg tgcagtcccc tgtgcccaag 600 

acacagcctg atgcttgtgc tccggtgggc ggagcttgga ggcggcggga. actgcaattg 660 

gtggctttga. aggcgcggcg agcgggaaca gctcttgagg agtgagactgr caggagatgt 72 0 

gggccgtgcc aaagagatgg atgagactgt tgctgagttc atcaagagga ccatcttgaa 7 80 

aatccccatg aatgaactga caacaatcct gaaggcctgg gattttttgt ctgaaaatca 840 

actgcagact gtaaatttcc gacagagaaa ggaatctgta gttcagcact tgatccatct 900 

gtgtgaggaeL aagcgtgcaa gtatcagtga tgctgccctg ttagacatca tttatatgca 960 

atttcatcag caccagaaag tttgggatgt ttttcagatg agtaaaggac caggtgaaga 102 0 

tgttgacctfc tttgatatga aacaatttaa aaattcgttc aagaaaattc ttcagagagc 1080 

attaaaaaat gtgacagtca gcttcagaga aactgaggag aatgcagtct ggattcgaat 1140 

tgcctgggga. acacagtaca caaagccaaa ccagtacaaa cctacctacg tggtgtacta 12 0 0 

ctcccagact ccgtacgcct tcacgtcctc ctccatgctg aggcgcaata caccgcttct 12 60 

gggtcaggag- ttagaagcta ctgggaaaat ctacctccga caagaggaga tcattttaga 132 0 

tattaccgaa atgaagaaag cttgcaatta gtgaacatga aaggaaaata aaaattcctc 13 80 
acagtcaaaa aaaaaaaaaa aaaaa 1405 

<210> 266 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> NM_018455 
<400> 266 

ccgacaagag gagatcattt tagatattac cgaaatgaag aaagcttgca attagtgaac 6 0 

<210> 267 
<211> 927 
<212> DNA 
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<213> Homo sapiens 
<300> 

<308> KTM_018465 
<400> 267 

ggcagcgggc gaaaggagcc ggggcctgga ggtttgcgta ccggtcgcct ggtcccggca 60 

ccagcgccgc ccagtgtggt ttcccataag gaagctcttc ttcctgcttg gcttccacct 12 0 

ttaacccttc cacctgggag cgtcctctaa cacattcaga ctacaagtcc agacccagga 180 

gagcaa-g-gcc cagaaagagg tcaaaatggg gtttatattt tcaaaatcta tgaatgaaag 240 

catgaaaaat caaaaggagt tcatgcttat gaatgctcga cttcagctg-g aaaggcagct 3 00 

catcatgcag agtgaaatga gggaaagaca aatggccatg cggattgccjt ggtctcggga 3 60 

attcctcaaa tattttggaa ctttttttgg ccttgcagcc atctctttaa cagctggagc 42 0 

gattaaaaaa aagaagccag ccttcctggt cccgattgtt ccattaagct ttatcctcac 480 

ctaccagtat gacttgggct atggaaccct tttagaaaga atgaaaggtg aagctgagga 540 

catactggaa acagaaaaga gtaaattgca gctgccaaga ggaatgatca cttttgaaag 600 

cattgaaaaa gccagaaagg aacagagtag attcttcata gacaaatgaa atcatgctta 660 

ccaatcaaat ctcaaagcac agaattattg acttgaatca tggtttttac agttttttaa 72 0 

atgctcaaga ttttgatatt atagatttta ttttaaaata ttaaaatgca agatagtttt 7 80 

gagctatttt aaaataaaat ttataacatt caacacaaaa tcatggagg-t gctctaaata 840 

acttttagat ttcctctctc tgtgtgcatt accaatatct aagtgtaaaa ttaataaatt 900 
gttttgeiatt cctggaaaaa aaaaaaa 927 

<210> 268 

<211> 60 

<212> 3DNA 

<213> Homo sapiens 

<300> 

<308> rsTM_018465 
<400> 268 

ggaaca^agt agattcttca tagacaaatg aaatcatgct taccaatcaa atctcaaagc 60 

<210> 2 69 

<211> 1047 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> NM_018487 
<400> 269 

cccacttctc cagccagcgc cccagccctc ccgccgcccg ctcgcaggtc ccgaggagcg 60 

cagactg-tgt ccctgacaat gggaacagcc gacagtgatg agatggcccc ggaggcccca 12 0 

cagcacaccc acatcgatgt gcacatccac caggagtctg ccctggccaa gctcctgctc 180 

acctgctgct ctgcgctgcg gccccgggcc acccaggcca ggggcagcag- ccggctgctg 240 

gtggcctcgt gggtgatgca gatcgtgctg gggatcttga gtgcagtcct aggaggattt 3 00 

ttctacatcc gcgactacac cctcctcgtc acctcgggag ctgccatctg gacaggggct 3 60 

gtggctgtgc tggctggagc tgctgccttc atttacgaga aacggggtgg- tacatactgg 420 

gccctgctga ggactctgct aacgctggca gctttctcca cagccatcgc tgccctcaaa 480 

ctttgga-atg aagatttccg atatggctac tcttattaca acagtgcctg- ccgcatctcc 540 

agctcga_gtg actggaacac tccagccccc actcagagtc cagaagaagt cagaaggcta 600 

cacctatgta cctccttcat ggacatgctg aaggccttgt tcagaaccct tcaggccatg 660 

ctcttgg-gtg tctggattct gctgcttctg gcatctctga cccctctgtg gctgtactgc 72 0 

tggagaatgt tcccaaccaa agggaaaaga gaccagaagg aaatgttggai agtgagtgga 780 

atctagccat gcctctcctg attattagtg cctggtgctt ctgcaccggg cgtccctgca 840 

tctgactgct ggaagaagaa ccagactgag gaaaagaggc tcttcaacag ccccagttat 900 

cctggcccca tgaccgtggc cacagccctg ctccagcagc acttgcccat tccttacacc 960 
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ccttccccat cctgctccgc ttcatgtccc ctcctgagta gtcatgtgat aataaactct 102 0 
catgttattg ttcccaggaa aaaaaaa 1047 

<210> 270 

<211> 6 0 

<212> DISTA 

<213> Homo sapiens 

<300> 

<308> NJyr_018487 
<400> 270 

aaccaaaggg aaaagagacc agaaggaaat gttggaagtg agtgcjasa.tct agccatgcct 60 

<210> 2 71 

<211> 2280 

<212> DISTA 

<213> Homo sapiens 

<300> 

<308> U17077 



<400> 271 

ccgcccgoca ccagctacgc cccgtccgac gtgccctcgg gggtcgcgct gttcctcacc 60 

atccctttcg ccttcttcct gcccgagctg atatttgggt tcttggtctg gaccatggta 120 

gccgccaocc acatagtata ccccttgctg caaggatggg tgatgta_tgt ctcgctcacc 180 

tcgtttctca tctccttgat gttcctgttg tcttacttgt ttggatttta caaaagattt 240 

gaatcctg-ga gagttctgga cagcctgtac cacgggacca ctggcatcct gtacatgagc 3 00 

gctgccgfc.cc tacaagtaca tgccacgatt gtttctgaga aactgctgga cccaagaatt 3 60 

tactacatta attcggcagc ctcgttcttc gccttcatcg ccacgctgct ctacattctc 420 

catgccttca gcatctatta ccactgatgc acaggcgcca ggccaag-ggg gaaatgctct 480 

ttgaaagctc caattattgg tccccaaaag cagcttccaa cgtttgccat ctggatgaca 540 

aacggaag-at ccactaaaac gtccacggga ttaacagaac gtccttg-cag actgagcgat 600 

gacaccacac tttgtttgga catttaaatt cactctgctg aataggagga agcttttctt 660 

tttcctgcjga aaacaactgt ctcttggaat tatctgacca tgaacttgct cttctagaca 72 0 

actcacatca aagccctcac tccactaatg gagaatccta gccccactaa tgccaagtct 780 

gtttgggcjat tttgcctcag ctatgggctt ccctagagta ggtctag-ggg aatactcagt 840 

ctgatctfctt ttttgtttgt tttattttgt tttttttgag acggagtctc gctcttcctc 900 

caaggctcjga gtgcagtgac gcgatctcca ctcactgcag gctccgcctc ccgggttccc 960 

gccattctcc tgcctcagcc tcccgagtag ccgggactac aggcgcccac caccatgccc 1020 

ggctaatfcta gttgtatttt tagtagagat ggggtttcac cgtattagcc aggatggtct 1080 

cgatctcctg acctcgtgat ccgcccgcct cggcctccca aagtgctg-gg attacaggcg 1140 

tgagccaccg tgcccggcct gattctctta aaattgaaga ggtgctgcca aggccttcag 12 00 

atctaacgca gatgcataga ccttgttcct ggtacttgtt cagcctgtgc tggggagccg 12 60 

tggtcccgrag ttccctggga ggctgacagg gtcaagccac cctgcccacc accctcccac 132 0 

ttcccctccc ctttcctctc cagcattagg attcaaggga aatctgcatg aagccaattt 13 80 

tgagggta_cja cgtgtgggga aaataaatca ttatacagta agacctgggg cttgaggggt 1440 

ggggaatcjcjg gagggaaggg catagcctgc tcctccatga gtctgacatc tcggaaactg 1500 

agcagctgj-cc ggacgcctgg gtcaggaatc caagacccca cctcttaagg actggttcct 15 60 

cagaaagcac cctcagggaa aaaggtgaaa acattacatc cgtggattct cctgccacaa 1620 

ccgcattg-g-a agaaaaggct gccgcaacat ctcagcgagg agtgaag^ac ccatgtccca 1680 

ggaaccgcgc tgcgccacct gcactcaccc ccctcacatt ctcttaagca cccggtggcc 1740 

ctccgagg-ct ggcggaatgg tggtgcccac ggggttgggc aagggctcac caggacctca 1800 

acgggcaa.<ag ttgtgcacac taaaatatca aatcaaggtg cttggtttta aagtaaatgt 1860 

ttttctaaag aaagctgtgt tcttctgttg acccagacga ataggcjcsLca gccctgtaac 192 0 

tgcacgtg-<cc ttctgtcatt gggaatgaaa taaattatta cgagaaaggg acttgtccta 1980 

actggttt^a ggccttacag ttttgtatct acatttttcc cctcctgorgg tttgcgggga 2 040 

cagggacaga actacaggag tcatgggaaa gaaaattctg gcttcactac tgctcactgc 2100 

tcactttc-bg atcactctga tacttttttt tttttttttt ttttgcasicc tgataccttg 2160 

aaaagcttct atgtgtctct ccttttgttg cctggcagct gtctaggatg atcactgatt 222 0 
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actatttact aagtagccac atgcaaataa aagttgtttg gtaaaatgga aaaaaaaaaa 22 80 



<210> 272 

<211> 60 

<212> D3STA 

<213> Homo sapiens 

<300> 

<308> U17077 



<400> 272 

tcaccaggac ctcaacgggc aaagttgtgc acactaaaat atcaaatcaa ggtgcttggt 

<210> 273 

<211> 2554 

<212> DlsTA 

<213> Homo sapiens 



60 



<300> 

<308> X87949 



<400> 273 

aggtcgacgc 

gagagggstag 

tgtgaccccg 

cctgtggctg 

gccgcgatgc 

gtgggcaogg 

aacggccgcg 

gccttcaotc 

aaccccgaga 

tctgtgcagc 

tacattcaag 

gccatggttc 

catgcagttg 

gctggaacta 

attgcttai-tg 

ggcggaacct 

actaatggag 

atcaaactgt 

aaactccggc 

gaaattgetgt 

gaagagctca 

gattctgeLtt 

attccaaaga 

ataaaccoag 

gatcaagata 

actgtagg-ag 

tctcagatct 

ggtgaaaQ-ac 

cctcctgctc 

attcttcg*ag 

aatgaccaga 

tttgctgstgg 

tatgcctatt 

tctgaagata 

caccaagatg 

caaccaatta 

acagcagaaa 

ggactcag-ga 

atcttcaoct 



cggccaagac 
cgccgcggcc 
ggcccctgcc 
gactgcctgc 
tgctgctgct 
tggtcggcat 
tggagatcat 
ctgaagggga 
acacggtctt 
aggacatcaa 
ttgatattgg 
tcactaaaat 
ttactgtacc 
ttgctggcct 
gcctggataa 
tcgatgtgtc 
atactcatct 
acaaaaagaa 
gcgaggtaga 
ccttctatga 
acatggatct 
tgaagaagtc 
ttcagcaact 
atgaagctgt 
caggtgacct 
gtgtcatgac 
tttctacagc 
ccctgacaaa 
ctcgtggggt 
tgacagctga 
atcgcctgac 
aagacaaaaa 
ctctaaagaa 
aggagaccat 
ctgacattga 
tcagcaaact 
aagatgagtt 
acttttgtta 
cagagtggag 



agcacagaca 
tgtatttcta 
gcctgcaagt 
tgctgcccaa 
cagcgcggcg 
cgacttgggg 
cgccaacgat 
acgtctgatt 
tgacgccaag 
gttcttgccg 
aggtgggcaa 
gaaagaaacc 
agcctatttt 
aaatgttatg 
gagggagggg 
tcttctcacc 
gggtggagaa 
gacgggcaaa 
aaaggccaag 
aggagaagac 
gttccggtct 
tgatattgat 
ggttaaagag 
agcgtatggt 
ggtactgctt 
caaactgatt 
ttctgataat 
agacaatcat 
cccacagatt 
agacaagggt 
acctgaagaa 
gctcaaggag 
tcagattgga 
ggaaaaagct 
agacttcaaa 
ctatggaagt 
gtagacactg 
ggaaaaaatt 
ttgaactgct 



gattgaccta 
gacctgccct 
cggaaattgc 
ctggctggca 
cgggccgagg 
accacctact 
cagggcaacc 
ggcgatgccg 
cggctcatcg 
ttcaaggtgg 
acaaagacat 
gctgaggctt 
aatgatgccc 
aggatcatca 
gagaagaaca 
attgacaatg 
gactttgacc 
gatgtcagga 
gccctgtctt 
ttttctgaga 
actatgaagc 
gaaattgttc 
ttcttcaatg 
gctgctgtcc 
catgtatgtc 
ccaagtaata 
caaccaactg 
cttctgggta 
gaagtcacct 
acagggaaca 
atcgaaagga 
cgcattgata 
gataaagaaa 
gtagaagaaa 
gctaagaaga 
gcaggccctc 
atctgctagt 
gaaagaactt 
atagcctaag 



ttggggtg-tt 
tcgcctgcg-tt 
gctgtg-c-tcc 
agatgaag-ct 
aggaggacaa 
cctgcgtcgg 
gcatcacgcc 
ccaagaacca 
gccgcacgtg 
ttgaaaag-aa 
ttgctcctga 
atttgg-gs.aa 
aacgccaagc 
acgagcctac 
tcctgg-tcgtt 
gtgtcttcga 
age g to; teat 
aggacaatag 
ctcagcatca 
ccctgactcg 
ccgtccagaa 
ttgttggtgg 
gcaaggaacc 
aggctg-gfcgt 
cccttacact 
cagtggtcjcc 
ttacaatcaa 
catttgatct 
ttgagat^ga 
aaaataagat 
tggttaatga 
ctagaaatga 
agctgggagg 
agattgaattg 
aggaactg-ga 
ccccaactgg 
gctgtaatat 
aagtctccjaa 
cggctgtfcta 



tcgcgagtgt 
cgtggcgcct 
tgtgctacgg 
ctccctggtg 
gaaggaggac 
cgtgttcaag 
gtcctatgtc 
gctcacctcc 
gaatgacccg 
aactaaacca 
agaaatttct 
gaaggttacc 
aaccaaagac 
ggcagctget 
tgacctgggt 
agttgtggcc 
ggaacacttc 
agctgtgcag 
agcaagaatt 
ggccaaattt 
agtgttggaa 
ctcgactcga 
atcccgtggc 
gctctctggt 
tggtattgaa 
taccaagaac 
ggtctatgaa 
gactggaatt 
tgtgaatggt 
cacaatcacc 
tgctgagaag 
gttggaaagc 
taaactttcc 
gctggaaagc 
agaaattgtt 
tgaagaggat 
tgtaaatact 
tgtaattgga 
ctgettttea 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 
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ttagcacjttg ctcacatgtc tttgggtggg gggggagaag aagaattggc catcttaaaa 2400 

agcgggtaaa aaacctgggt tagggtgtgt gttcaccttc aaaatgttct atttaacaac 2460 

tgggtcatgt gcatctggtg taggaagttt tttctaccat aagtgacacc aataaatgtt 2520 
tgttatttac actggtcaaa aaaaaaaaaa aaaa 2554 

<210> 274 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> X87949 
<400> 274 

aactttcctc tgaagataag gagaccatgg aaaaagctgt agaagaaaag attgaatggc 60 

<210> 275 

<211> IL359 

<212> OIXTA 

<213> Homo sapiens 

<300> 

<308> Contigl632 



<400> 2 75 

ttttaag;a.ca 

gtgaatttat 

tcatagcaca 

ttaccatatt 

attattttct 

agatgtag-gg 

aacaaatggt 

tgtgtagg-ca 

ttccatagag 

aaagttgcag 

atgaaaggat 

atggatgeitg 

gcatttcctt 

aaagataaag 

tctctacttc 

catgttcttt 

aagaacactt 

aatccacttt 

aggaaatata 

tacccgtttt 

tatatctaca 

ccaaaagtat 

ggtaagatta 



gttacctgtt 

caatcttttg 

atgtctttct 

ttgatatacc 

aaagaaacaa 

tacagtggaa 

gagctggaac 

acattgcctt 

ttctgtggtc 

atattggaat 

catttaatga 

cttaagcctg 

agcccttaga 

aaaaaacata 

aaccaaaatc 

ctaattggat 

ctgctttttg 

gaataatcca 

taaagctgtt 

ggtatatatt 

taaactgtaa 

gtaaaaagaa 

gcagtcaata 



gtgctgctgt 

atgtaaagta 

ataagatatt 

atttttttct 

tcatattttt 

cataagcagt 

atgccctgtc 

atcacgctag 

acaaaattgt 

gtatggaagt 

ctgttttact 

gtgaggtttg 

gtttcttgca 

tttaattact 

agatctttga 

ttatgaatag 

atccactgtt 

tgttttgtat 

atttattctg 

gcctctgcac 

ataatccttt 

acctgcatta 

aagttacttt 



tacaatatat 

aaaacgtagt 

tttaatgatt 

atctgcccag 

atacaaaatt 

gttacccctg 

tgtgctgtcc 

gttcacctga 

tttgctttta 

atctcagtct 

tataagtcat 

tactctaagg 

aacatttaaa 

gtaaacaggt 

ggttttgctg 

ttctatgggt 

tgcagcagaa 

ttggaaattg 

catttcttac 

atctacattt 

ctgtgaaagg 

ttttgtaatt 

tttgccttt 



aatgaaacca 

tcacacttca 

tagtatttta 

ttttattaaa 

atgttttcag 

gctg-ggagtc 

ctcc tgtgct 

cact ttaaaa 

tcaaatactt 

ctgcataaga 

taagtaatcc 

agcccagatc 

aaaagacata 

actgctttat 

acattgttgg 

tttcaaagat 

ttatatatat 

tttttzaaaaa 

atatctatcg 

gtatatgcaa 

atcatcatat 

atttcttata 

1359 



agtcagggga 60 

ggagagaact 12 0 

caacatttgt 180 

aaaactatat 240 

gtaacgaaat 3 00 

agtattatac 3 60 

gggtcgcgga 42 0 

ggaaaaaaag 480 

taatagaacc 540 

ggattaaagt 600 

accatttctt 660 

ataatgcagt 72 0 

tttaagaaag 7 80 

gtttattttc 840 

tggttttgca 900 

gaatcatgct 960 

g t a t aggaaa 102 0 

taaaaaggaa 1080 

cttgtcagta 1140 

cagtgagctt 12 00 

caagatgata 12 60 

gatatttcat 1320 



<210> 27 6 

<211> 60 

<212> DJXLA. 

<213> Homo sapiens 

<300> 

<308> Coxatigl632 
<400> 276 

gggttttc^a agatgaatca tgctaagaac acttctgctt tttgatccac tgtttgcagc 60 
<210> 277 
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<211> 994 
<212> DM 
<213> Homo sapiens 

<300> 

<308> Contig-3464 



<400> 277 

tgaatgtata 

actttca-citg 

tgtctgt tat 

gtacttt tta 

gttactg-taa 

tgacgtg-atc 

gaataagctt 

caaatggcgg 

aacttgt tct 

gcctttt caa 

tgaaagc ttt 

ctgactaaag 

ctttttttct 

agtcaagaca 

ctgcact ttt 

aactgtatca 

ggttcaataa 



tattaagact 
ttagacattt 
agctctttaa 
ttcaatgtaa 
tttttttttt 
tttaagactg 
ttatcaaata 
tcttatagtt 
ggcaaaccag 
ctcttaatgc 
tgtatgtatt 
ctgacctaaa 
ttggagggag 
aacacatctg 
tatgacattt 
catttatact 
aattggcaaa 



gtagctgaat 
atccttaagt 
ttcatcagta 
tcagaagctg 
tttttttacg 
ctatatatat 
gataattgat 
tcaattctga 
accctgctgt 
cctccacatg 
actaggtttt 
tggatcagtt 
ggtaacatta 
aaataaagga 
tgaaaatcta 
tgcagaaatt 
actgaacacc 



tgcacatgaa 
tgtgagcgat 
ttaatacaga 
tgatgttttg 
aagcacgtga 
cagtctctta 
gcaatttagg 
aaatagcaaa 
agatatagtc 
aatcagcagt 
gtttttctta 
tatgtgtaat 
tttggacaga 
actgtgtatt 
tttataggta 
tatttcattg 
aaaa 994 



atcag-attgc 
atatg-tagca 
attat cattt 
cctttgtagt 
ctgga-ctaat 
ctcta-taagg 
attcacgcaa 
cttaataaac 
taagg-tagtt 
taagaaggtt 
tgttt <jctga 
attctagtgc 
tgcagaagga 
aacatgttaa 
cagaacaatg 
ttattagtag 



caacttcttg 
tgctgtgaaa 
gcgtttcttg 
cctgtgcttt 
gtaaggcaga 
ttttaaatta 
gtttcagtgt 
agccacttta 
aaccatataa 
ctagaaccca 
ttttacagtt 
tttaatgact 
actgttagtg 
caattcataa 
ggttttgtta 
gaattttatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 



<210> 278 

<211> 6 0 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig3464 



<400> 278 

ctgctgtaga tatagtctaa ggtagttaac catataagcc ttttcaactc ttaatgccct 60 

<210> 279 
<211> 423 
<212> D3NTA 
<213> Homo sapiens 

<300> 

<308> Contigl4683 
<400> 279 

tatgttatgg atatcttatt ttagagtaag aatataaggc atagccatat ttatgaaggt 60 

agtaataotc tactaatcaa tacttagaag tttttgttat gactaatctg aatgcttttt 12 0 

agtttttcct taatctagtt atgttggtaa tttataagtc agttttcaga ttaggaaaga 180 

aggtatttga gggtgttcca tttccactga atagtaagat gatgcttact tagatttcca 240 

cagctgtttg aaagctctgt atttggctat aacggaaaac tttgttraggg atgcttgatg 3 00 

ttttgtgrttt tgtttctaaa ggaagacagt gttttgttcc ttctttagaa aacttgaaga 3 60 

atagaataat gagtccagga ttaatttggg ataaagtctt ttacttcata aattctgatt 42 0 



ctg 423 



<210> 2S0 

<211> 60 

<212> DIsTA 

<213> Homo sapiens 
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<300> 

<308> Contigl4683 
<400> 280 

aggaagacag tgttttgttc cttctttaga aaacttgaag aatagaataa tgagtccagg 60 

<210> 281 
<211> 391 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig28552 



<400> 281 

atgccattga 

gaaccacatt 

aaaaacgtgt 

agaaaaagcg 

tgctggsLggc 

caaaagatgt 

accatggtaa 



tgtgaagaag 
catgcaagca 
cagaatagct 
tttattagaa 
cttgcagtcc 
tcggcagagt 
taaaccgtat 



gtgtctgtgg 
ataaaggaga 
aaagaattag 
atgaagactg 
ggggctgcct 
ctcagtccaa 
ttataaattg 



aagactttct 
atatcaaaaa 
cagagcgaga 
agggtgatga 
tccgcgacag 
tgtctcagag 
c 391 



tactgacctg 
aagagaagca 
aagactcgaa 
gacaggagtg 
aagaaaaagg 
gcctgttctg 



aataacttca 60 

gaggaaaaag 12 0 

cgccaacaaa 180 

atggataatc 240 

acaccgatgc 3 00 

aaagtttgta 3 60 



<210> 282 

<211> SO 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig28552 
<400> 2 82 

aagactttct tactgacctg aataacttca gaaccacatt catgcaagca ataaaggaga 60 

<210> 2 83 

<211> 4=50 

<212> DISTA 

<213> Homo sapiens 

<300> 

<308> Contig28947 



<400> 2 83 

ctcatccaag 

tcatccctgc 

gtttttctta 

agtgata_ctg 

caatcttgtt 

ttccaactgg 

agaaagcjtgg 

gaagctctga 



gagctggggc 
cctctggtgc 
aacaggagac 
gagatggcta 
gggaaaaatg 
ttaatgatat 
gatggccttc 
aaataaaaga 



agacttcatt 
cagcctcctt 
agtctctccc 
gttgctgtgt 
tgggggagag 
tgtttgtaag 
agattatacc 
tcttgaaccc 



gattctagag 
accatcacgg 
ctcttacctc 
tacgggtttg 
gacttcttcc 
aaagagattc 
agcttagcta 
450 



acjacctgttt 
cttcactgag 
aacttcttgg 
ac/ttacattt 
tacacgcgca 
tgrttggttga 
gcattactaa 



cagtgcctac 60 

gt g t aggt gg 12 0 

Srgtgggaatc 180 

ggctataaaa 240 

ttgagacaga 3 00 

ctgcctaaag 3 60 

ccaactgatg 42 0 



<210> 2 84 

<211> 6 0 

<212> DISTA 

<213> Homo sapiens 

<300> 

<308> Contig28947 



<400> 2 84 
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agacagattc caactggtta atgatattgt ttgtaagaaa gacrattctgt tggttgactg 60 

<210> 285 

<211> 439 

< 212> DNA 

<213> Homo sapiens 



<300> 

<308> Contig30875 
<400> 285 

agaaatcaat gacagttgac aggaagagag gacgcataca acaggcaaaa gaggaatgcc 60 
cagcagtctt ggtccttgcg gtgcaatact ggccttgagg ccaagtcagc aggggattcg 12 0 
tagtcactaa cttctaactg aggcagggaa gtaccatgtt ctcrgaaaagg tccaaagaaa 180 
caggaataga ggcagtgtag caagaggcag atttttggtg ccaaatagat ttgaatcctg 2 40 
gttctgcttc ttcctttgta gagtatgata ttggttcttt cctcccaaag ctattataaa 3 00 
gactaaatat gtacacaaat ctttgggatg tctgacatat aaatgcttaa caataggtat 3 60 
ttgctggtat tattacaaat gaatttgctt atttttgagc cacttctatg tctgtccatt 420 
aaaccaaaat gtgttctgc 43 9 

<210> 286 

<211> 60 

<212> DNA 

<2 13> Homo sapiens 

<300> 

<3 08> Contig30875 
<400> 286 

ggttctttcc tcccaaagct attataaaga ctaaatatgt acacaaatct ttgggatgtc 60 

<210> 287 
<211> 338 
<212> DNA 
<213> Homo sapiens 

<3 00> 

<3 08> Contig31221 
<4=00> 287 

gggaagttac actgcttcac accacaaggc cgtgggaaat ctt^gaggtt ctgtgccttt 
ctgtcacctc tactttttgc agctgtgatt gcactgtccc gcacatgtga ctacaagcat 
cactggcaag gaccctttaa atggtgaaaa tgggcagatg aatagcaata agtggacctt 
tgrttactctt ctgagttaga aaaattctaa tttagtacac tctgaacaaa gcttattata 
cttacttaag atgtgttttg atttggtgtt cagaaagcaa cctgacaatg ataatactgt 
aactatgata aaattgagaa taaaaagatt ttatttag 33 8 

<2 10> 288 
<2 11> 60 
<2 12> DNA 
<2 13> Homo sapiens 

<3 O0> 

<3 08> Contig31221 
<4O0> 288 

aaatgggcag atgaatagca ataagtggac ctttgttact cttctgagtt agaaaaattc 60 

<210> 289 
<211> 417 



60 

120 

180 

240 

300 
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<212> DNA 

<213> Homo sapiens 

<30O> 

<30 8> Contig31288 
<40O> 289 

gaatcacttg agcccgggag gttgaggctg cagtgagcto; tgtttatacc actgcactcc 60 

agcctgctgg gtaacagagc aagactccat ctcaaaaaga aaagaaaaaa tgctttgcta 12 0 

cataatgagg ccaggcaaaa aaaaaaaaag tcctgtggaa atcatataga caaacatttg 180 

caaagctgct actgccattg taccagtgtt aaaatgtgtt ctaccttgca tcttttactg 240 

atttttatga cagattttat attgtaacca tttgagaact ctgtaagtgc tatggcttcc 3 00 
ttaaactacg atttatcata tgctcccagt gtttactttg agactgaatg gcaaccagag 360 
aatgtaaaca accaaggtgc atctggttat gttttaaaat aaagattaat aaaagtt 417 

<210> 290 

<211> 60 

<212> DNA 

<2 13 > Homo sapiens 

<30O> 

<308> Contig31288 
<40O> 290 

ggcttcctta aactacgatt tatcatatgc tcccagtgtt tactttgaga ctgaatggca 60 

<210> 291 
<2in_> 394 
<212> DNA 
<213> Homo sapiens 

<30O> 

<308> Contig31646 
<40O> 291 

gctgctacac cccatgtaaa aagcggaaaa taaaatgaag attttccagc gcaagatgcg 60 

gtactggttg cttccacctt ttttggcaat tgtttatttc tgcaccattg tccaaggtca 12 0 

agtggctcca cccacaaggt taagatataa tgtaatatct catgacagta tacagatttc 180 

atggaaggct ccaagaggga aatttggtgg ttacaaactt cttgtgactc caacttcagg 240 

tggaaaaact aaccagctga atctgcagaa cactgcaact aaagcaatta ttcaaggcct 3 00 

tatgccagac cagaattaca cagttcaaat tattgcatac aataaagata aagaaagcaa 3 60 
gccagctcaa ggccaattca gaattaaaga ttta 394 

<210> 292 

<211> 60 

<212> DNA 

<213:> Homo sapiens 

<300> 

<308> Contig31646 
<400> 292 

gccagaccag aattacacag ttcaaattat tgcatacaat aaagataaag aaagcaagcc 60 

<210> 293 
<211=> 357 
<212> DNA 
<213> Homo sapiens 

<300> 
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<308> Contig37562 
<4O0> 293 

caattatttc aagtgcacct tattaacaaa agtatcagtg gatccaacat aaaattttat 60 
agtactaaat gtcaagccta actgtgaatt ttgttctgta tcttaagtaa atttatgata 12 0 
atgttctcga gctatcaaca aaatatatgt acttttgtga gctatgaatt ttctaattaa 180 
attttacatg ctataacatg atttttacat gaatgatact ttgtttataa ctatcaaatg 240 
tcagtatttt actacaattt tattataaag tgtacattat cactaaatga acttcgattt 3 00 
taaaaatcaa attagcttta gttgtatatt attttttaca aataaagata gacttgt 357 

<210> 294 

<23_1> 60 

<21.2> DNA 

<21_3> Homo sapiens 

<3O0> 

<3 0 8> Contig37562 
<4O0> 294 

atcaaatgtc agtattttac tacaatttta ttataaagtg taoattatca ctaaatgaac 60 

<21 0> 295 
<211> 351 
<212> DNA 
<213> Homo sapiens 

<30O> 

<308> Contig37895 
<40O> 295 

aatagagaca cctctaatta attaaagcgg atgccctccc cactcctccc aggatttgac 60 

tcggagcaca aactcttcac aaaccaaaat gtcaggacac catcgccagt gtccactggc 12 0 

cactgctgtt ggtgtgaggc agccaggagc ccctcagaac tagtaagtct gagaagaggc 180 

tgcacggggc ctaggagagg gagaaatgag cccgtccaag gtg^attcct tgattctcca 240 

ttgtgagtgc accaagaaca agcactccct ccgactgact ctcgcctacc aggatctgga 3 00 

acaccttcca ttaatttatt cgttcattca ataaatattt attgactgac t 3 51 

<210> 296 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<30O> 

<308> Contig37895 
<40O> 296 

ctctcgccta ccaggatctg gaacaccttc cattaattta ttcgttcatt caataaatat 60 

<210> 297 
<213_> 418 
<212> DNA 
<213 > Homo sapiens 

<300> 

<308> Contig38288 
<400> 297 

gacaagtaaa tgggggccgt tgggacggcg ggtgcctgga gggcagctct gggctcagcg 60 
ggcacjtgctt agagcacagg cccctctgtt gggggatggg gacjgoigagca gtctgccctt 120 
gggacgcgtag gccccaggga gacttctaaa gccccccctg tcgtctgctc ttcacccagc 180 
accacagagg cacctgctgc acacacaagc atctcactcg gcccacggag ggggccaggc 240 
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ttcctttgcc tgaagctgtt ttgggaaggg tctccacaca ggcactgatc tcccaagctt 3 00 
tg-gtcatgat gtcttttacc atttgataat tttaaacatt gtttttaaac ccaaaacatt 3 60 
tagtggtccg ttgcctctga agatgtaaac aaacaaatac actatttctg ggaacatt 418 

<210> 298 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> Contig38288 



<400> 298 

tttagtggtc cgttgcctct gaagatgtaa acaaacaaat acactatttc tgggaacatt 60 

<210> 299 
<211> 413 
<212> DNA 
<213> Homo sapiens 



<300> 

<3 08> Contig38901 



<400> 299 

tacatttttg tttaatgttg ggcctgaggt taactgtgac catggtccag cttgagtggc 60 

ttctggagca gccacatttt caaggactgt ccaaaagcca gccagttcag ggctcaggcc 12 0 

tcacccattg cccactcctg gggagaccat cacctggctc atcgtttcca ccaagagtgc 180 

cocacaggag tgccccacag acccgctgga ccagcctgct gcgggtcctg gccaggggtc 2 40 

tggctaacgg tgagggctga ctctgaactg tctctcagtc tccagaaagt gttcaagcct 3 00 

gttgtgttcc caaatctgat tcctcctatt gtcttgtaaa tcaaactcta agtgaaaact 3 60 

tcccatttgt cccttcaaag attttttttt attaaatggt tttttaagat cct 413 

<210> 300 

<2 11> 60 

<2 12> DNA 

<2 13 > Homo sapiens 

<3 00> 

<3 08> Contig38901 
<4 00> 300 

tgttcccaaa tctgattcct cctattgtct tgtaaatcaa actctaagtg aaaacttccc 60 

<220> 301 
<2 11> 434 
<212> DNA 
<2 13> Homo sapiens 



<3 00> 

<3 08> Contig40434 



<400> 301 

ga-atggtgaa agagagatgc cgtgttttga 
aagaaacatt cagaaacata ggaattaaaa 
cacaaacttt acactttaat ctgatgattg 
ttagctaact ttaaaaaatg gatgtagaat 
gtatcagttt gaacctagaa tattgaatta 
aaggaatgga ggaaaattgc atcttagacc 
ctagaaatga gataaagatt atttattttt 
at-fcttatgaa atcc 434 



aagtaagatg atgaaatgaa tttttaattc 60 

cttagagaaa tgatctaatt tccctgttca 12 0 

gatattttat tttagtgaaa catcatcttg 180 

gattaaaggt tggtatgatt tttttttaat 240 

aaatgctgtc tcagtatttt aaaagcaaaa 3 00 

atttttatat g-cagtgtaca atttgctggg 3 60 

gttcatatct tgtacttttc tattaaaatc 42 0 
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<210> 302 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<3 08> Contig40434 
<400> 302 

aaggaatgga ggaaaattgc atcttagacc atttttatat gcagtgtaca atttgctggg 60 

<210> 303 
<211> 391 
<2 12> DNA 
<2 13> Homo sapiens 

<3 O0> 

<3Q8> Contig40552 



<400> 303 

caccaagccc 

ttactgtgcc 

cttgcagctc 

gctggctctg 

gttttgatgc 

ggaLtggtcag 

gattttttgt 



tgctccggca 
cccagaccaa 
cagagccccc 
agtgtccctt 
cctcactagt 
ccctacccac 
tcaataaaga 



cctcgaatcc 
gagctccatc 
tggcccagcc 
ccctagaaag 
gttgaagcct 
ctgccctgtt 
atttggtaaa 



ctggcgacca 
ccccgccctc 
cgtgactcct 
gcctgaccct 
gttccagaga. 
tgagcttcct 
a 391 



tgagtcacca 
tgcagcgttt 
ctttggggcc 
ccttacccac 
gaggtgggac 
gtttgacaat 



gctccaagcc 60 

gtctagcgcc 12 0 

tacagatgaa 180 

cagaacaggg 240 

t gcaaggaga 3 00 

gtttgctgtt 3 60 



<210> 304 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<3O0> 

<308> Contig40552 
<4O0> 304 

tttgagcttc ctgtttgaca atgtttgctg ttgatttttt gttcaataaa gaatttggta 

<210> 305 
<211> 495 
<212> DNA 
<213> Homo sapiens 



60 



<30 0> 

<3 0 8> Contig41413 



<40O> 305 

aaatattctt 

gagacattgc 

tgteittcagt 

agacaatttt 

taaagttaat 

tcttgcattg 

agctgaataa 

ggaaacaagt 

aactgttctt 

<210> 306 
<211> 60 
<212> DNA 



aatagggcta 
atgaaagatg 
atttgaacgt 
taaatctctg 
ccagattata 
tgaggtacag 
tgtgaggatt 
atttcagtct 
atgtc 495 



ctttgaatta 
atgagagata 
cgtcctgttt 
taatatgata 
tggtccttat 
gcggaagttg 
aacttctgcc 
caaaattgaa 



atctgccttt 
aatgttgatc 
gttgttagtt 
cattttccta 
atgtgtacaa 
gaatcaggtt 
agctcagacc 
taatgcacaa 



atgtttggga 
ttttggcccc 
ttcttcatca 
tcttttaagt 
cattaaaatg 
ttaggattct 
atttcctaat 
gtcttaagtg 



gaagaaagct 60 

atttgttaat 120 

tttattgtat 180 

tattgttacc 240 

aaaggctttg 3 00 

gtctctcatt 360 

cagt t gaaag 42 0 

attaaaataa 480 
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<213> Homo sapiens 
<300> 

<308> Contig41413 
<400> 306 

cagctcagac catttcctaa tcagttgaaa gggaaacaag tatttcagtc tcaaaattga 6 0 

<210> 307 
<211> 409 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig41538 



<400> 307 

aaaaaaaaaa 

tacatagtcc 

gatgctgatt 

ctgccaaggg 

catccagact 

cacttatagc 

gtattcctga 



aaaaaaaaaa 
gaaggtctta 
agttctcagt 
tcataaatgg 
tctggaactc 
ttcaggcatg 
tagtaatgct 



aaagagttgt 
caactaatca 
tttcattcag 
tgactgccta 
aaagattaac 
tatttatatg 
acaataaatc 



tttctcat cjt 
ctggtagcaa 
ttcacaatat 
acaacaaaat 
ttttgactsia 
tattcttgat 
caaacatttc 



tcattatagt 
taaatgcttc 
aaccaccatt 
ttgcagtctc 
ccctggaata 
agcaatacca 
aactctgtt 



tcattacagt 60 

aggcccacat 12 0 

cctgccctcc 180 

atctcatttt 240 

tctcttatct 300 

taatcaatgt 3 60 
409 



<210> 308 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig41538 
<400> 308 

ctcatgttca ttatagttca ttacagttac atagtccgaa ggtcttacaa ctaatcactg 60 

<210> 309 
<211> 552 
<212> DNA 
<213> Homo sapiens 



<300> 

<308> Contig41887 



<400> 309 

ctgaagacta cgaccatgaa atcacagggc tgcgggtgtc tgtaggtctt ctcctggtga 60 

aaagtgtcca ggtgaaactt ggagactcct gggacgtgaa actgggagcc ttaggtggga 12 0 

atacccagga agtcaccctg cagccaggcg aatacatcac aaaagtcttt gtcgccttcc 180 

aagctttcct ccggggtatg gtcatgtaca ccagcaagga ccgctatttc tattttggga 2 40 

agcttgatgg ccagatctcc tctgcctacc ccagccaaga ggggcaggtg ctggtgggca 3 00 

tctatggcca gtatcaactc cttggcatca agagcattg-g ctttgaatgg aattatccac 3 60 

tagaggagcc gaccactgag ccaccagtta atctcacata ctcagcaaac tcacccgtgg 42 0 

gtcgctaggg tggggtatgg ggccatccga gctgaggcca tctgtgtggt ggtggctgat 480 

ggtactggag taactgagtc gggacgctga atctgaatcc accaataaat aaagcttctg 540 
cagaatcagt gc 552 



<210> 310 

<211> 60 

<212> DNA 

<213> Homo sapiens 
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<300> 

<308> Contig41887 
<400> 310 

tactggagta actgagtcgg gacgctgaat ctgaatccac caataaataa agcttctgca 60 

<210> 311 
<211> 745 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig42342 
<400> 311 

gcagtaaaga caggacgcac ccatgtcaca agaggagcac aggcaggggt gttggtgttg 60 

gggcagccct cagggtctcc agacccagcc ccactcacac agcagcctag gaaggaaggg 12 0 

cagagtccca ggtgtcagct ggtgggtctc ccaggagctg cccctccctg gaagtcacag 180 

gacaggaatg acagatcagg gaactgcagg aagctgccac ctctggggtc agaatatgcc 240 

cagcctgcgg gggctctcta tcggggtctt cgagagccag acagcctgcc ttgtgctgca 3 00 

tacctggctt tgctctgtgc agaacccagc acacgtgatt ttgtgtgaca tgccagcagc 3 60 

ctggctccca ggacaggagg cctgccctgg gggagg-g-gct gcaggaggag ggggggcagg 42 0 

cacccatgag tctgtccagc cttgtcacag atgcatcgcc caagctgcgg tcctgatttc 480 

agctcacctc agagtaaatc agaataaact gcacccagac tttcacgaat gcatgttgac 540 

gctttcagtt cacccctttc tttgctaact ttcttcctat tttcttctaa tgcgagagct 600 

tattaattcc atatttatca ttttgaataa cttttctcct ttttagtaac aaaatgtact 660 

tcactcttag taaaatgtat ttactatttt agtaacaaaa atatacttgc ctaatcatgt 720 
ttaaaatata gtgatgtgaa aaatt 745 

<210> 312 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig42342 
<400> 312 

cacccagact ttcacgaatg catgttgacg ctttcagttc acccctttct ttgctaactt 60 

<210> 313 
<211> 398 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig43645 
<400> 313 

agttcaaagg cagataaatc tgtaaattat 
attactccaa aataattaaa tttaaggctt 
taataaagtt tcatgttaat gtcataggat 
aatatgtgta tattaaaatg taattgattt 
gcattagggt tctttgcaat gtggtatcta 
attttgaaaa gcttatactg gcagcctaga 
ctggcacatg aataaacttt gctgtggttt 

<210> 314 
<211> 60 
<212> DNA 
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tttatcctat ctaccatttc ttaagaagac 60 

tatcaggtct gcatatagaa tcttaaattc 12 0 

ttttaaaaga gctataggta atttctgtat 180 

cagttgaaag tattttaaag ctgataaata 240 

gctgtattat tggttttatt tactttaaac 3 00 

aaaacaaaca attaatgtat ctttatgtcc 3 60 
actaatct 398 
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<213> Homo sapiens 
<300> 

<308> Contig43645 
<400> 314 

gaaaagctta tactggcagc ctagaaaaac aaacaattaa tgtatcttta tgtccctggc 60 

<210> 315 
<211> 478 
<212> DNA 
<213> Homo sapiens 

<300> 

<3 08> Contig44289 
<400> 315 

ctaaaaacaa cactcatcag tcttgggaaa tttga_acttt gatcaactta actaaagaag 60 

gaagggtagt aagaattttt caaatacaaa tatttgccaa ttcacagatg ataacattta 12 0 

aggccttcaa aagtaagggt ttttccttgt ttctccagtc agcttttgtc aactctaata 180 

gttttttcat aaacattttt tatttgtata attgcaacag tttaagaaat tatcacaact 240 

atttagaaac atttaaaatg ttctttttga tataagctat atacttggaa aaatacattg 3 00 

gtatctaaaa tttgaggtgt gttaagactg ctttfctgttt taaaaaatgg tttacattca 3 60 

aatttttgaa gtgttttatg cttcatatgg ctaacj-ttgta gtttggcaga gttaacagca 42 0 

taagaataaa catgctgtaa ttttaaaaga tgctttgaat aaaaatttat tttaattt 47 8 

<210> 316 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig44289 



<400> 316 

catcagtctt gggaaatttg aactttgatc aacttaacta aagaaggaag ggtagtaaga 60 

<210> 317 
<211> 556 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig44909 
<400> 317 

accatctggg atttctacag cctgggtacc catagccaca ccaaggcttc tgggagattc 60 

tgcagggtca gctttccagg ctgttcccaa atagctccct gcctccccac tgcccctaaa 12 0 

gccacagcag aagagccatt catctcataa acaaaaagga agaggaaaga atgaggaagg 180 

accctgtgca aggttatttg caggcaggga tgggc ttgta cctgacagca cccacccctg 240 

tgtggccccc aggccctcat caccctcaga cccctcctaa gcagttccct cattgctctt 300 

tggactaggc tgacagcagg aagagcaggg cccatgaccg ggtggaagfct cagttttggt 3 60 

gtctgcttca agagggggtt ttacactctg attccaggac aagcactctg aggcgggtgg 42 0 

gggagagaaa ccctggctct tcacccaggt ttcacacaca tgtaaatgaa acactatgtt 480 

agtatctaac acactcctgg atacagaaca caagtcttgg cacatatgtg atggaaataa 540 
agtgttttgc aatctt 556 

<210> 318 
<211> 60 
<212> DNA 
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<213> Homo sapiens 



<300> 

<308> Contig44909 
<400> 318 

tcacccaggt ttcacacaca tgtaaatgaa acactatgtt agtatctaac acactcctgg 60 

<210> 319 
<211> 710 
<212> DNA 
<213> Homo sapiens 



<300> 

<308> Contig45032 



<400> 319 

aaagataggc 

ggcacctgtc 

tggcctaaca 

tatgtgagac 

tatcatactg 

atgtggactg 

aaacagatac 

ttagtctgta 

ataattatat 

gttcaatgtg 

tagtcagtaa 

atttattttg 



ttctaagtta 
atatgccaag 
gacattaaac 
attctctttt 
ttttgtttta 
ggaaaatctg 
ctcctatttt 
gcacaaaaat 
actccttaat 
cttttaatgt 
tttatcttaa 
tttattgcac 



aggcaaatca 
tgatattcaa 
aaatgtctgt 
tataataatt 
tatttttcct 
cagcatcaga 
gaagccagga 
tttcctttta 
actagattga 
acgttacagc 
cctcaaaact 
tttatgaaaa 



ttcattctgt 
aatggcccat 
gaaactgaca 
cctgtaaagc 
aagagctaaa 
ctatgccttt 
atactgctca 
gtctagcctc 
tctaaggggg 
tttcacagag 
gtatataaat 
ctgaagcatc 



cattaaacaa 
gtagaccttt 
taataaagta 
agtacttact 
acgtcatcct 
catcccccag 
gcacctgacc 
gtgttataga 
aaagatcatt 
ttaaaaggct 
cttcaaagct 
aataaaatta 



atacaaacca 
gtgaagtatg 
aggtaagtta 
taggtaatga 
ctcttcagtg 
ccagatgatg 
gtatctggat 
atgaacttgc 
atttaaccta 
gaaaggaata 
tttttcatct 
710 



<210> 320 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig45032 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 



<400> 320 



ttaacctagt tcaatgtgct tttaatgtac gttacagctt tcacagagtt aaaaggctga 60 

<210> 321 
<211> 726 
<212> DNA 
<213> Homo sapiens 



<300> 

<308> Contig46218 



<400> 321 

atacatattg ctttagagag caggtaggtg 
aataccatct ttctaagcca ctggaatttt 
caggtcatca acctcaagtc tttacatatc 
ggcccaagtt tctccttaat tgggatctat 
attgaataca tcagagctat atatgtaagg 
ttgaattcat gttaagtaga ccctctaggg 
aatgatgcaa taagaatgta tgcactctct 
ggatgtaaac agaattgcag tgctgccgaa 
ccccttgtaa ttatgactct gagataaaat 
ttttttattt tatgaataaa cttgggattg 



gccatgtgtt cagcagtgtg tccttaagaa 60 

tactttacta tttttaacat taatggatgt 12 0 

catgtatatt ccatatatat tgtttatata 180 

atactaccag cacaacatca aaaacatgta 240 

aaatgactgg tgaccccatt atcatcattg 3 00 

gaccataagg caattgagca cataacgaaa 3 60 

ttgccaaatg catgtgcttt tgtgtaacgt 420 

attcttgatc ttggctaaga gagtattttt 480 

tgccattttg aaatttccaa agtaacaact 540 

caatttctct gatctgacaa tcaataactt 600 
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taacaaagat ctaaataagt gtttcaagga aagttttcct aagcaaatgt aatattacct 660 
catttgggca tcattactct gttaattcta tatcaaagga aataaacttg ctacttgcac 72 0 
taaatg 72 6 

<210> 322 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig46218 
<400> 322 

accataaggc aattgagcac ataacgaaaa atgatgcaat aagaatgtat gcactctctt 60 

<210> 323 
<211> 580 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig47096 
<400> 323 

ggtggtctct catccttgtg tgctgctctg ctaagagatg tccaaggcgg agccggggca 60 

agatccttcc agactcatct gtcagagccc caagcccttt agacccagag cccaaggacc 120 

atgcctttgg gacattagga ctgcagcctt tgcttctgtg tattttggag ttttggtgac 180 

ttttgtcacc tggacacact catttgttag ccatagtggg ttcccttggt cagcaacagt 24 0 

gcatgtacct ctggatgtca tctgaggtga gaccaccgag gccttttctc tctgtgtaca 3 00 

gaggggagtt aggagttgct ggactggcatg cattacgagg actggggaca gggtagaggg 360 

acatccaggg atcagggcat gagtgggg-gc aaccccccgg cctctgccct ggcatggtct 420 

ccgcatgggc tgaggtgtag ctgattgg-ct gccacatttc ggccatgctg gctggcgtgc 480 

ccatgttgca gatattttcc cgagttcccc agaatggatg gtattgaatc tcagccacat 540 
gcaacactgt gtccagcatt ctttgcaata aatacttttt 580 

<210> 324 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig47096 
<400> 324 

atattttccc gagttcccca gaatggatgg tattgaatct cagccacatg caacactgtg 60 

<210> 325 
<211> 632 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig47563 
<400> 325 

gccatctagt ctgtggtttt ctgttgaa^c agtctgaatt gactaaaaca gtcacttgga 60 

gtagttataa accactttcc tgttgaaa^c agaacatgct gattcaactg ttttgttcaa 12 0 

tagcaatgat agattttgtt taagtcccct acactttctt atttctaaat gatcaagagt 180 

acacttcctg gcagtgatta aggagtgtg-t atctaacaga aaaaatatat ataccctgtg 240 

aacccgaata tggaattcag attgtttctg ccctcagtat catacttaaa aaacaagcat 3 00 

acaaacaaac ataagggaac aaacagcaac cataacaaaa acaaaccttt aaaggtgggt 3 60 
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ttttgctgtg ataaatgaat acggtactct 
aaactgcaag tgatttaaaa attagagaat 
cagaaaacct caagtgaatt ttgtatgtaa 
tttaagcaaa caattagctg aaaacttggt 
gcacccatca gaaaataaag ggcattaaat 



gaaggagaaa aaagtttctc aaatgagctt 4 20 

ataattctta aagctattga aagtttcaac 4 80 

atgaaatctt gaatgtaagt tctgtgattc 5 40 

attgttgtag tttatgtagt aagtgacttg 6 00 
tg 632 



<210> 326 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig47563 



<400> 326 

agcaaacaat tagctgaaaa cttggfcattg ttgtagttta tgtagtaagt gacttggcac S 0 

<210> 327 
<211> 540 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig48913 



<400> 327 

accagagggt gtcccttttc cacagtaatg ggatcggctg gtgtgccttc agggaggaag 6 0 

agggaggtgg tcaagcttga aaaactggct ttaggatggt tctgactttg ttctccctcc 120 

ccaagtgttc tcaacctcca ttctgcagtg ttcagagttt tagggaaagg gtttgggtgc 180 

cccagcatcc aggtgttgtg tggcttagcg catgtgaagt gaaaaccttc tggggttgtt 2 40 

tggaagcagc tttctggttc ttgtgattgt atcctgaggt cccagaaccc tattctccca 3 00 

cgaggatcct cagtgaccat ggtggccaca cgcctggcca gcctgctggc tcctgggtga 3 60 

gctgaagaac cttgcctgtg gcacttttcg agggtgagct ggaaccgaga gaacatggtc 4 20 

cccgtgctgg gactcatgcg ggtcatttcc tgccggcctg gtttcgcctg gtcgtgtctt 4 80 

tatgagcacc atgtaagcct ccttgta.ttg agataattgg gcattaaaca ttaaactgca 5 40 

<210> 328 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> Contig48913 
<400> 328 

tatgagcacc atgtaagcct ccttgtetttg agataattgg gcattaaaca ttaaactgca 6 0 

<210> 329 
<211> 534 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig49169 



<400> 329 

cctaatgtta acatttttaa aaatacatat 
tgttaattta caattcatgt ttcaagacat 
agggggtcac tgggtctcat agactg-a.tat 
caaactaatg gaaggcagac tattta_cagc 
cagagaaatg cctcccgtag tgtttg-ciaag 



ttgggactct tattatcaag gttctaccta 6 0 

ttgccaaatg tattaccgat gcctctgaaa 12 0 

gaagtcgaca tatttatagt gcttagagac ISO 

ttagtatatg tgtacttaag tctatgtgaa 2 40 

cgttaagctg ataatgtaat taacaactgc 3O0 
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tgagagatca aagattcaac ttgccataca cctcaaattc ggagaaacag ttaatttgg-g- 3 60 

caaatctaca gttctgtttt tgctactcta ttgtcattcc tgtttaatac tcactgtact 42 0 

tgtatttgag acaaataggt gatactgaat tttatactgt tttctacttt tccattaaaa 480 
cattggcacc tcaatgataa agaaatttaa ggtataaaat taaatgtaaa aatt 53 4 



<210> 330 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig49169 



<400> 330 

catacacctc aaattcggag aaacagttaa tttgggcaaa tctacagttc tgtttttgct 60 

<210> 331 
<211> 602 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig49388 
<400> 331 

tgtcagtgga ggggtctctg cagccaactg agactatctt gctgtgccct gagccttcct 60 

agggtttaga agaacagcat tcaaaattcc ccgtcctgtc agtgtttgcc ttcgcacctc 12 0 

ctcccctaaa gcagcgcggg gggcaaataa gaccccaccc ctccctgcag cttcacagg-g 180 

acgcttcctt ccctccccgc aaccacccca ggctcccctg ggaggctgca gttgtggtac 240 

acgtccccgg tgctgggttg gccgtgactc gggggcgggg cgatcgggtc tcagcccctg 3 00 

ccttccccag tctctgggtc acccgaattt tcccacccct gcttctcccc gaggaggttg 3 60 

agctcttgag caagttggga cttg-g-gccgg ggcctggaag aatgattggc tgggaggccg 42 0 

cgggagggag gccaggaggc ccggaccagt tgggaggagt gagcaggccc cgggggaggg 480 

ggatgagcgc agtttgctcg ctttcctccc ctgccggccc cctccgcccc cacacacact 540 

cgggacgtct tcattgaaga ttcacttaca aaggaatgtt tcactaaata aaagaaaacc 600 
ag 602 



<210> 332 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig49388 
<400> 332 ' 

cgggacgtct tcattgaaga ttcacttaca aaggaatgtt tcactaaata aaagaaaacc 60 

<210> 333 
<211> 562 
<212> DNA 
<213> Homo sapiens 



<300> 

<308> Contig50728 



<400> 333 

gcgaatttgg gccccttgat cctctgatgg 
atttagggag gctgttcagg ctttcjcaggt 
ctgtgactgt gtccatgtgt gtgtg-tttgt 



gagctgaaag gatgagaggt gggcatctag 60 
cccttacctg aacacataga aaccctggag 12 0 
ctgtgtgtgt tgcgggggat gggcacctgc 180 
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atgaatgtgg tagagaaaat ggctctgctc agagggaaga tacgcatagc aaggcaggga 240 

ccagaggaat cacaggcgcc tggagagcag ccgggcaccg cctccaggga cctgccggct 3 00 

tccctcagtc ctccaggggc ccagcactct tcctttaggc cctgtgagcg tcccttgtca 360 

ggatacattc tctcattttg ctgaagctga tttgattggg tgtctgtttc tcgcagccaa 42 0 

aagagctctg aatgaggaaa gtgcttctgt gctaactccc cgcgtctcct gaatttca^t 480 

cattcatgta cccgcctcga aatttttgca atatctgtgt accaactgtc catttactta 540 
ataaagaagt tttctttaaa tt 5 62 

<210> 334 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig50728 
<400> 334 

tttgattggg tgtctgtttc tcgcagccaa aagagctctg aatgaggaaa gtgcttctgt 60 

<210> 335 
<211> 400 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> AI497657 
<400> 335 

tttttttttt tgcacttatg gtatttattg ttggaagatt gagtacctta atgcacacca 6 0 
atgctcagat gacttgggcjg cacatagggg actgctgtca ccatgcctca ctcctgcacjg 12 0 
gaaggggctg ccctactaaa accccagcgg gcccagtgct gtgtccagaa caggtcctta 180 
tattactgca gcccacaatg gaactactga gtaggagcca aaagaggagg gagcaggaag 240 
aggtggcatt tggagagggj-g- agaccgcacc cacaggtctg ccacagcgcg tcaacggtcat 3 00 
ggggtacttt tacagtcaa_g- ttgacttcgg tgtccgccca ccatctacct ttgtaggaoc 3 60 
actgaaacaa gggacatcca ccacggccca cagccggggc 400 

<210> 336 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> AI497657 
<400> 336 

gagcattggt gtgcattaaQ- gtactcaatc ttccaacaat aaataccata agtgcaaaaa 60 

<210> 337 
<211> 475 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig50950 
<400> 337 

ctggaagagg ctcccaaccc agagtgtccc tgtgggaggc aggcagaagg tgacaattg-a 60 

cacgatttcc tgcacgcgtc ctcctctacc ttggaagcag ttagaatcta ccaggcacag 12 0 

atgaggccgc ccttgcctga cggagcttga tgagcagccc ttggtctccg gttccaggac 180 

tgagagccca gctgcctctg- cccacccttc cccaggcctc tgccagcctc tggctgcac<g 240 

gtcaggccct gccccatggc aggcctgcca gagcttggct ggggacccct cccgcctctg 3 00 
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gctccctgat gggctg-gatg taacttgtgt cttctagccc cttaaggagc ccaggtgttt 360 
taaggaatga attggtcact gcatcttgta tcgattatgg ttctgagaaa agcaaatatc 42 0 
acttttggct gcattaaaag aagcatcata tataaaataa agaagatgaa gg-tct 47 5 

<210> 338 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig5095O 
<400> 338 

gtcactgcat cttgtatcga ttatggttct gagaaaagca aatatcactt ttggctgcat 6 0 

<210> 339 
<211> 860 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig51660 
<400> 339 

ggatggcaac cttcagctag actgcctggc tcaagggtgg aagcaatacc aacagagagc 60 
atttggctgg ttccggtgtt cctcctgcca gcgaagttgg gcttccgcca agtgcagatt 12 0 
ctgtgccaca cgtactggga gcactggaca tcccagggtc aggtgcgtat gadget cttt 180 
ggccaaaggt gecagaagtg ctcctggtcc caatatgaga tgcctgagtt ctcctcggat 240 
agcaccatga ggattctgag caacctggtg cagcatatac tgaagaaata ctatggaaat 3 00 
ggcatgagga agtctccaga aatgccagta atcctggaag tgtccctgga aggatcccat 3 60 
gacacagcca attgtgaiggc atgcactttg ggcatatgtg gacagggctt aaaaagctac 42 0 
atgacaaagc cgtccaaatc cctactcccc cacctaaaga ctgggaattc ctcacctgga 480 
attggtgctg tgtacctcgc aaaccaagcc aagaaccagt cagatgaggc aaaagaggct 540 
aaggggagtg ggtatgeigaa attagggece agtcgagacc cagatccact gaacatctgt 6 00 
gtctttattt tgctgcttgt atttattgta gtcaaatget ttacatcaga atgatgaaaa 660 
taggcttgee actttctctt attttaattc catggtagtc aatgaactgg ctgccacttt 72 0 
aatataactg aaaattcatt ttgagaccaa gcaggatcaa gtttgtagaa taaacactgg 7 80 
tttcctagcc atcctctgaa aacagtatga aacatgacca agtacataat ggatttagta 840 
ataaatattg tegaattget 860 

<210> 340 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig51660 
<400> 340 

gctgcttgta tttattg-tag teaaatgett tacatcagaa tgatgaaaat aggcttgeca 6 0 

<210> 341 
<211> 608 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig52490 
<400> 341 

atcgtggcta geggacagae acgagcctct tgggaatacc ttgtccatca cgfccatggcc 60 
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atgggtgcct tcttctccgg catcttttgg 
ctactggtgg aagtcagcaa catcttcctc 
gcccaggatc atctcctcta ccgggttaac 
ttccgcctgg cccctcaggc ctacctcacc 
accctgggca ccttcctgct gggtatcctg 
ttttcccgcc tcctccgctc tgacttctgc 
gacaagttct tgactgagaa ctgagtgagg 
aaggccagaa acagcttcat atggacactg 
aggccagcct ctccaccttc tgagcctgcg 
ctcctctg 608 



agcagctttg tcggtg-gggg tgtcttaaca 12 0 

accattcgca tgatga_tgaa aatcagtaat 180 

aagtatgtga acctgg-tcat gtactttctc 240 

catttcttct tgcgttatgt gaaccagagg 3 00 

ctcatgctgg acgtgatgat cataatctac 3 60 

cctgagcatg tccccaagaa gcaacacaaa 42 0 

ggcacagagc ctgggacaac aaaaacggac 480 

ggacttagcc ccaagcctgg gtgtcctctg 540 

cccacactat tgaaaacact aatgaaagta 600 



<2 10> 342 

<2 11> 60 

<2 12> DNA 

<2 13> Homo sapiens 

<3 O0> 

<3 08> Contig52490 



<4O0> 342 

ccaggatcat ctcctctacc gggttaacaa gtatgtgaac ctggtcatgt actttctctt 60 

<2 10> 343 

<211> 1282 

<212> DNA 

<213> Homo sapiens 

<3O0> 

<3 08> Contig53598 
<4O0> 343 

catgccagca cctttgaacc ggtctcttag aagaagacac acatcctggg tgtacagtgg 60 

tgaaatgggg agtgggtgcc cattctgaaa aacgaggcat tcctgctcat tccctctgct 12 0 

tacrctggtgg gcaggggaga gagggaaatg ccaaaaactt ggagtgaagg atgatgctat 180 

tttttatttt taaatatatc ttcaggttat tttcttactg ttgcttcaga tctaatgtaa 240 

aaggcagatg tcccctcctc tccacccccg acgctgaccc cggcctcagt cacggctctt 3 00 

tgcatgatca cagttctgtg ttctggcctg tggcagggcc gggaaggrgcc gctggcttcc 3 60 

gaacagacgt ggttgctctc cacgaggcgc atggggagcc cgcgggccct aagctttgtc 42 0 

gcagatgtca tcattggcag aattacttgt cttgaaaaat aagtagcatt gctgaaacac 480 

acaaccgaat tctctacgat ggccatttgc tcattgtctt tcctctg-tgt gtagtgagtg 540 

accctggcag tgtttgcctg ctcagagtgg cccctcagaa caacagg-gct ggccttggaa 600 

aaaccccaaa acaggactgt ggtgacaact ctggtcaggt gtgatttgac atgagggccg 660 

gacrcrcggttg ctgacggcag gactggagag gctgcgtgcc cggcactggc agcgaggctc 72 0 

gtcrtgtcccc caggcagatc tgggcacttt cccaacccag gtttatgcgt ctccagggaa 7 80 

gcctcggtgc cagagtggtg ggcagatctg accatcccca cagaccagaa acaaggaatt 840 

tctcrggatta cccagtcccc cttcaaccca gttgatgtaa ccacctcatt ttttacaaat 900 

acagaatcta ttctactcag gctatgggcc tcgtcctcac tcagttattg cgagtgttgc 9 60 

tgtccgcatg ctccgggccc cacgtggctc ctgtgctcta gatcatgcrtg actcccccgc 102 0 

cctcrtggttg gaatcgatgc cacggattgc aggccaaatt tcagatcgtg tttccaaaca 1080 

cccttgctgt gccctttaat gggattgaaa gcacttttac cacatggsiga aatatatttt 1140 

taatttgtga tgcttttcta caaggtccac tatttctgag tttaafcg-fcgt ttccaacact 12 00 

taaggagact ctaatgaaag ctgatgaatt ttcttttctg tccaaacaag taaaataaaa 12 60 
ataaaagtct atttagatgt tg 12 82 

<210> 344 

<2ia> 60 

<212> DNA 

<213> Homo sapiens 

<30O> 

<308> Contig53598 
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<400> 344 

ccactatttc tgagtttaat gtgtttccaa cacttaagga gactctaatg aaagctgatg 60 

<210> 345 
<211> 601 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig53641 
<400> 345 

tggaggctgt ggatgatgct ttcaagacaa tggat^tgga tatggccgag gaacatgcca 60 

gggcccagat gagggcccag atgaatatcg gggat^aagc gctgattgga cggtggagct 12 0 

gggatgacat acaagtcgag ctcctgacct gggat^agga cggagatttt ggcgatgcct 180 

gggccaggat cccctttgct ttctgggcca gatacoatca gtacattctg aatagcaacc 240 

gtgccaacag gagggccacg tggagagctg gcgtcagcag tggcaccaat ggaggggcca 3 00 

gcaccagcgt cctagatggc cccagcacca gctccaccat ccggaccaga aatgctgcca 36 0 

gagctggcgc cagcttcttc tcctggatcc agcaccgttg acgaactgca gcgatcttac 42 0 

tggccaagcc agagcgcctc ctctcagatt ccttctcgac acagcaccct aggcggcttc 480 

ttcctgtcag tcggaggtgg catgcaagat gaagctctct ttgctcttcc tgctttcatt 540 

ttgtgctttt ccttgtgttt tcatgttttg ggtatcagtg ttacattaaa gttgcaaaat 600 
t 601 



<210> 346 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig53641 



<400> 346 



ctttcatttt gtgcttttcc ttgtgttttc atgttttggg tatcagtgtt acattaaagt 60 

<210> 347 
<211> 751 
<212> DNA 
<213> Homo sapiens 



<300> 

<3 08> Contig54242 



<400> 347 

aattactcaa agaaggagcc atttcagtta 
gcagtgggtc cttccctgtt gaccatttgg 
ctgcaacagg ccttgccaga gggctcagga 
gaggtaattt tacatttccc ctttaaagta 
ccctataaag aatgagtcga actagaccag 
tccaacaggg gtagcaatga cctgtccact 
tggttaaagg cacacgtgag actttcagtg 
gtcaggtatg caggcattgc attgtcagtg 
ttccaagggt gaatctcctg ctctgtgaat 
tgggtctatg tacgttcttc cgaagcactg 
ttgattggtt gagattaact ggtgtggtgg 
ttgtatgtag ttctacataa tgcaaattgt 
gattaatatc aataaaaagg ggatgttgtg 



actcaagtga atgaaagact tttggaatct 60 

taacttgtaa tctgaccaaa aactcttgag 12 0 

tgggaaagga agaaggggat aggaaaagaa 180 

aattttagcc aactcatcat tctgaaatgt 240 

aagccagcct actccttctt acatagcttc 3 00 

tcaaacacag ataaggcctg ccatcctcat 3 60 

ggctctgctg agaaggaagg cagcccagga 42 0 

tctgcfcctca gagtttacac attcaattgc 480 

gctatcagac cccaaaggcc aaccttgggc 540 

atgatcaaaa ttgaagacac attcagaggt 600 

ttggtg-tatg tatgttttat ttttatgtct 660 

gctttotgat ggacaagacc tcataactgt 72 0 
g 752. 



<210> 348 
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<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig54242 
<400> 348 

gtaaatttta gccaactcat cattctgaaa tgtccctata aagaatgagt cgaactagac 6 O 

<210> 349 

<211> 637 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig54661 
<400> 349 

ggcagtgatg tctatgttga gattaactta tgtattgagg aaaatttgaa gtttattttt 6 0 

tcgatgaata aggctgtcaa atgatttagt atagattaat gacatctttt ttagaaatat 12 0 

taaagtgagt attcctcatt atgtcatcat ttctgataat tagagtgcta atttgaatgt 180 

tagataatgt ttccacatct atacctattt ctttctaggg cacttctgac cctggggctt 2 40 

ggggatggcc tttaggccac agtagtgtct gtgttaagtt cactaaatgt gtatttaatg 3O0 

agaaacattc ctatgtaaaa atg-tgtgtat gtgaacgtat gcatacattt ttattgtgca 3 60 

cctgtacatt gtgaagaagt agtttggaaa tttgtaaagc acaaaccata aaagagtgtg 42 0 

gagttattaa atgatgtagc acaaatgtaa tgtttagctt ataaaaggtc ctttctattt 480 

tctatggcaa agactttgac acttgaaaaa taaaaccaat atttgattta tttttgtaag 5 40 

tatttaggat attattttaa ataaatgatt gtccattatc aatataatag ttgtgaaatg 6O0 

atttaagtaa ataaacttta tgcttctgtg tctgttg 637 

<210> 350 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig54661 
<400> 350 

ctgtacattg tgaagaagta gtttggaaat ttgtaaagca caaaccataa aagagtgtgg 60 

<210> 351 
<211> 924 
<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig55188 
<400> 351 

gcgacaagta ccgcaagcgg gcactcatcc tggtgtcact gctggccttt gccggcctct 60 

tcgccgccct cgtgctgtgg ctgtacatct accccattaa ctggccctgg atcgagcacc 12 0 

tcacctgctt ccccttcacc agccgcttct gcgagaagta tgagctggac caggtgctgc 18 0 

actgaccgct gggccacacg gctgcccctc agccctgctg gaacagggtc tgcctgcgag 24 0 

ggctgccctc tgcagagcgc tctotgtgtg ccagagagcc agagacccaa gacagggccc 30 0 

gggctctgga cctgggtgcc cccctgccag gcgaggctga ctccgcgtga gatggttggt 36 0 

taaggcgggg tttttctggg gcgtgaggcc tgtgagatcc tgacccaagc tcaggcacac 42 0 

ccaaggcacc tgcctctctg agtcttgggt ctcagttcct aatatcccgc tccttgctga 48 0 

gaccatctcc tggggcaggg tccttttctt cccaggtcct cagcgctgcc tctgctggtg 54 0 
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ccttctcccc cactactact ggagcgtgcc cttgctgggg acgtggctgt gccctcagtt 600 

gcccccaggg ctgggtgccc accatgcccc ttcctctttc tcctcctacc tctgccctgt 660 

gagcccatcc ataaggctct cagatgggac attgtgggaa aggctttggc catcrgtctgg 72 0 

gggcagagaa ca.oigggggga gacacaagta gacctcaggt agaacgacac tgg-g-cggagc 7 80 

caccccaggg cctgctccca gggagtgctc gaggcgcatc aggcccgttt tttaccagtt 840 

tatatcacgg tcttcatttt taaaagtaac gctaactttg tacggacgat gtctcatgga 900 
ttaaataata ttctttatgg cagt 924 



<210> 352 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig5 5188 



<400> 352 

agtaacgcta actttgtacg gacgatgtct catggattaa ataatattct tta.tggcagt 60 

<210> 353 
<211> 699 
<212> DNA 

<213> Homo sapiens 



<300> 

<308> ContigB 5353 



<400> 353 

tgattatgcc aagagctcta aacagaagtt tgagaaggta aaaattaagt tgtagtatct 60 

gagttgtttt tattttcttc ctttggtgtt tatgaaggta ttcataagaa ctttaatttc 12 0 

aggggaaaaa atgcctgatt tgctattttt gacatttcct cgtctcttaa gaagtcagtt 180 

aaatatgttt tcatagttta tattcctgtt tcatagatta ctgtgaaaca tgtatttaaa 240 

cctatgaatt ataaaatagt atttagattc tagcgtgagt taaatagatt agtcatatat 3 00 

cttttagatt tg-"tggatttg acatgtaaat tatgtgttgt gtataagtaa gttagttact 3 60 

aaacatatgg catggttatt gataaacttg ttgctatttt tttccaaatg cta_tcagtgt 42 0 

ttgtggactt ttaaaaatta gtttgaattt tggaatgttc tgtgataaaa tataatttca 480 

actattttgt acatttaaat atgccatgtt gtatatgtct gtatttaaaa atgttgtaaa 540 

tatctgcatt ttaagaatta tgaaagattt tcctcaaaaa tgacagaact ctccatactt 600 

aattgtgaca cattataaga tatctgattt taagcttttg gattttgttc taaaaattaa 660 

gtttaaacat gctgaaaatt ccataaaaat aaaattttg 699 



<210> 354 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig5 5353 



<400> 354 

taaaatagta tttagattct agcgtgagtt aaatagatta gtcatatatc ttt tagattt 60 



<210> 355 
<211> 809 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig5 6503 



<400> 355 
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gcatgt^aga 
agggaatcct 
tgggcg-^aca 
tgcccctttg 
agcagcccct 
ttttctcctt 
agcccctttg 
agaaagctta 
ggccccaggg 
tttaccagca 
actttccatc 
gaccccttgt 
ggtggcctgc 
atgtgcttaa 



tgagtgactg 
gtcaccatca 
gcatggagct 
cccgcctcct 
cggaggcaga 
gccttcgcat 
ccatagcctg 
tggctgggtc 
caggcagggg 
gccctggcat 
ccaaaggcag 
ccctctttga 
gctagcttct 
taaatcaatt 



ccggtgaatg 
ataatcactt 
ctccatggcc 
gcaaacctca 
ggaaggaaaa 
ggctggcctt 
attttgggga 
tggtttcttc 
ccacactatg 
gttcctgccc 
tctccgtggt 
gggaggggag 
tttgatactg 
ccaagcctc 



tgtccacagt 
gtggagcgcc 
aggctgcctg 
cagggtcccc 
tggggatggc 
cccctccaaa 
ggaggaaggg 
ccttcccaga 
cctgcgccct 
cacaggaata 
tgaagcagac 
ctatgctagg 
aaaactttta 
809 



tga^aggttg 
act ctgccca 
tgt^catgtt 
acacaacagt 
tggg-gctctc 
acctccattc 
gcgatttgag 
ggg-fccttact 
ggtaaaggtg 
gaa-tggaggg 
tgg<atttttg 
actccaacct 
aggtgggagg 



gagcaggatg 
agacgccacc 
ccctgtctgg 
gccctccaga 
tccatcctcc 
ccctgctgcc 
ggagaagggg 
gttccagggt 
acccctgcca 
agctccagaa 
ctctgcccct 
cagggactcg 
gtggcaaggg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 



<210> 356 

<211> 60 

<212> 3DNA 

<213> Homo sapiens 

<300> 

<308> Contig56503 



<400> 356 

gaaaactttt aaggtgggag ggtggcaagg gatgtgctta ataaatcaat tccaagcctc 

<210> 357 
<211> 976 
<212> DNA 
<213> Homo sapiens 



60 



<300> 

<308> Contig56678 



<400> 357 

gaaggatata 

gaaatatgta 

tttagaattgt 

tttgtgtgaa 

tgatgtcttt 

tgaggtttaa 

ctgaacctga 

ccctcatcca 

gctttgtccc 

gggtgag-gca 

tgctctgaca 

gaactgcaca 

acagtgttaa 

tggtctcjttt 

attgtaa.aga 

ctggccttat 

aataaactgg 



ctttgttata 
ttctgaactc 
gttaatttta 
agcatgtgat 
ttctcagtcc 
gatcaaatat 
ttgctacttt 
cagacatttg 
cgttgttaga 
ggtggctggc 
cttccactgt 
gccaggagca 
attattacat 
tctcttcagc 
tacaataaac 
tcgcgttctg 
ttgaaa 976 



acttattatt 
tatctgcaga 
aagggacctg 
catactaacc 
atagttacaa 
aaattactct 
ttcatcttaa 
gagaaggaaa 
ctggcagcgt 
ggggacaggg 
ccctgctggg 
aggaacccac 
aagcaggtga 
aaaaatgact 
cggttgaaat 
ttctcctgca 



ttgttctctg 
atgagtcact 
ataggtattt 
cagcctcctg 
ttgtttagta 
gcttttcgac 
atattatatt 
tgggagggtg 
cagttgctcg 
agaggctgag 
gatgctgggg 
taaatactcc 
aaggtagaag 
atttttgtgt 
atctgctttg 
aatagcgccc 



taaatacaag 
acaccaaaat 
atttacatat 
gaatgtcgct 
tgctaatcag 
tcattcaggt 
tcctcatcta 
tctgttatcc 
gtgcrgcttgg 
aggcjaagtgg 
ccaaggcctg 
gtcacctcca 
gcgaattatg 
gtgactaatt 
ttgacaagcg 
tctaaaaaga 



atgtttatag 
agttctatta 
gcgatccaca 
gtacgatgat 
tccagttccc 
agcattgtac 
atctgccttc 
ctttctcttt 
ttagagccgt 
tggcatttac 
tggggcctgt 
tgtcccctct 
tgagtaaata 
tatttttatt 
tgtgctttct 
agagtcagac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 



<210> 358 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig56678 
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<400> 358 

tattacataa gcaggtgaaa ggtagaaggc 



gaattatgtg agtaaatatg gtctgttttc 60 



<210> 359 
<211> 1118 
<212> DNA 

<213> Homo sapiens 
<300> 

<308> Contig57584 



<400> 359 

agctgttgtg 

cccttgcccc 

caccagcgcg 

tagtgctggc 

agcaggggag 

tcttggtatt 



catccagagg 
caccctgcaa 
gcctccgccg 
atcctgcgca 
caactggagg 
gcacggacac 



tggaattggg- 
ctcctggttg- 
acaacaagca. 
ctcccatcca 
gtcttaaaca 
ctatgaagac 



gcccggcatt 
agatgggctc 
tctggctcga 
ggtggagagc 
tgcccaggac 
cagcagtgga 



ccctcctcgt 
agccaagagc 
gtggcggacc 
tctccacagc 
tcagatcccc 
gaccccccaa 



cccgggctgg 60 

gtcccagtca 12 0 

cccgttcacc 180 

caggcctacc 2 40 

gctctcctac 300 

gcccactggt 3 60 



gaaacagctg agtgaagtat ttgaaactga. agactctaaa tcaaatcttc ccccagagcc 42 0 

tgttctgccc ccagaggcac ctttatcttc tgaattggac ttgcctctgg gtacccagtt 480 

atctgttgag gaacagatgc caccttggaa ccagactgag ttcccctcca aacaggtgtt 540 

ttccaaggag gaagcaagac agcccacaga aacccctgtg gccagccaga gctccgacaa 60 0 

gccctcaagg gaccctgaga ctcccagatc ttcaggttct atgcgcaata gatggaaacc 660 

aaacagcagc aaggtactag ggagatcccc cctcaccatc ctgcaggatg acaactcccc 72 0 

tggcaccctg acactacgac agggtaagcg- gccttcaccc ctaagtgaaa atgttagtga 7 80 

actaaaggaa ggagccattc ttggaactgg- acgacttctg aaaactggag gacgagcatg 840 

ggagcaaggc caggaccatg acaaggaaaa tcagcacttt cccttggtgg agagctaggc 9 00 

cctgcatggc cccagcaatg cagtcaccca gggcctggtg atatctgtgt cctctcaccc 9 60 

cttctttccc agggatactg aggaatggct tgttttctta gactcctcct cagctaccaa 102 0 

actgggactc acagctttat tgggctttct ttgtgtcttg tgtgtttctt ttatattaaa 1080 

ggaagtaatt ttaaatgtta ctttaaaaag- gtatatgt 1118 



<210> 360 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig57584 
<400> 360 

aggaatggct tgttttctta gactcctcct 



cagctaccaa actgggactc acagctttat 60 



<210> 361 
<211> 859 
<212> DNA 
<213> Homo sapiens 

<300> 

<308> Contig63649 



<400> 361 

gtcgcagggt accagtgtgc ggagttcctg- 
acaggtgtgg tcatatcttc agccaacagg 
ttcctacggg aagagagtga cagatttggt 
gccgctgcgg gagacagcgg gtggatcctc 
aagtctactg ttaaaatcag gaccgggtcg" 
cccaggcctg caggagttga gggctgcacc 
tggacgcctg gcatggactc ggactggcct 
gtcaccactg cgaagtgagg cttggaggac 



ttgccaagct gaaggtggcc ctgggcaggc 60 

accatcctcc ggagggccac ctctggggac 12 0 

gcttctgtgt gtttctgccg cttcagtggg 180 

cagcagcctg tctgctgagc ctgccttctc 240 

tgtccgagcc tacaggccct gtctccgctc 3 00 

tgctcgctgg agagggagag gcagatttag 3 60 

ttggaagctc cctgccctga cgggttgcct 420 

ctgcacctga gaaaggctgt gtgtggtctt 480 
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gggtccacac 
agtgaaccgg 
aactctgggc 
ccgtacttat 
cacgccctgc 
gttcacaata 
aaaaaaaaaa 



ctgccagagc 
ggtgtcctca 
gaggagggtc 
accaggtggg 
tgtggaactg 
tagttttcaa 
aaaaaaaaa 



taacttactg 
gctggcccta 
ccagcgccgc 
atccctgccc 
gatgttttca 
agtacagttt 
859 



ccagacggcg 
cagagcactt 
ccctcgatac 
tgcattgcct 
gggagcccag 
aaaactcaaa 



acttactgtg 
ctgtgctggg 
agccctgctc 
ggggattggc 
cctttcctca 
agtaaacttt 



ggccaccctc 540 

gatgagtag-g 6 00 

tgccctctgc 660 

tgggcttgg-g 72 0 

tgtcaacaca 7 80 

tcagcaactc 840 



<210> 362 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig63649 



<400> 362 

cagcctttcc tcatgtcaac acagttcaca atatagtttt caaagtacag tttaaaactc 60 



<210> 363 

<211> 1170 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> Contig63525 



<400> 363 

gccatggctc cctgggcgga gcgagcactc gcggctgaac ccgctgcgcg cggtgtggct 60 

cacgctgacc gccgccttcc tgctgaccct actgctgcag ctcctgccgc ccggcctgct 120 

cccgggctgc gcgatcttcc aggacctgat ccgctatggg aaaaccaagt gtggggagcc 180 

gtcgcgcccc gccgcctgcc gagcctttga tgtccccaag agatattttt cccactttta 240 

tatcatctca gtgctgtgga atggcttcct gctttggtgc cttactcaat ctctgttcct 3 00 

gggagcacct tttccaagct gg-cttcatgg tttgctcaga attctcgggg cggcacagtt 3 60 

ccagggaggg gagctggcac tg-tctgcatt cttagtgcta gtatttctgt ggctgcacag 42 0 

cttacgaaga ctcttcgagt gcctctacgt cagtgtcttc tccaatgtca tgattcacgt 480 

cgtgcagtac tgttttggac tfcgtctatta tgtccttgtt ggcctaactg tgctgagcca 540 

agtgccaatg gatggcagga atgctacata acagggaaaa atctattgat gcaagcacgg 600 

tggttccata ttcttgggat gatgatgttc atctggtcat ctgcccatca gtataagtgc 660 

catgttattc tcggcaatct caggaaaaat aaagcaggag tggtcattca ctgtaaccac 72 0 

aggatcccat ttggagactg gtttgaatat gtttcttccc ctaactactt agcagagctg 7 80 

atgatctacg tttccatggc cgtcaccttt gggttccaca acttaacttg gtggctag-bg 840 

gtgacaaatg tcttctttaa tcaggccctg tctgcctttc tcagccacca attctacaaa 900 

agcaaatttg tctcttaccc gaagcatagg aaagctttcc taccattttt gttttaag-bt 960 

aacctcagtc atgaagaatg caaaccaggt gatggtttca atgcctaagg acagtgaagt 102 0 

ctggagccca aagtacagtt tcagcaaagc tgtttgaaac tctccattcc atttctatac 1080 

cccacaagtt ttcactgaat gagcatgcag tgccactcaa gaaaatgaat ctccaaagta 1140 
tcttcaaaga attaattact aatggcagat 117 0 



<210> 364 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<300> 

<308> Contig63525 



<400> 364 

ctcttacccg aagcatagga aagctttcct accatttttg ttttaagtta acctcagtca 60 



<210> 365 
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<211> 632 
<212> DNA. 

<213> Homo sapiens 



<300> 

<308> Con_tig64688 



<400> 365 

aagaatgcta agatgatttc agatatcgaa 
gatgaactQ-c ttcggttaga gccacagctg 
aaagagagaa agtcttccct taggaatgca 
tatcaagatt attcagatgt tcaagctcaa 
tccagccttc cagctctgtt atttaaagca 
cgaaatatoa accatcagtt agagaagctc 
aaaatgtgcc tataggaaga ctagtctcat 
aatcaatt^t tttgcaaaga agttatggcc 
taaatggctg tgaacaatgt taaatagcat 
tcatcttttc tggttaatat cttgagtaat 
ggtatgagcc ataataaact tgtaaaatta 



aagaaaaggc agcgtatgat tgaagtccag 60 

aaacaactac aaacaaaata tgatgaactt 12 0 

gcatatttct tatctaattt aaaacagctt 180 

gaaccaaacg taaaggaaac gtatgattca 240 

agaacacttc tgggagccga aagccatctg 3 00 

cttgaccagg gatgagaaga gcagtctact 3 60 

gctgttacct tctgaaactg tacctttata 42 0 

tacttagaat ctaaaatttg ttattcaaat 480 

cagtttgtcc aatagtttta aaggccataa 540 

tttaaaatgt tgacacctta atcggtccca 600 
ag 632 



<210> 366 

<211> 60 

<212> DNA 

<213> Homo sapiens 



<300> 

<308> Contig64688 



<400> 36 6 , cn 

ggctgtga.ac aatgttaaat agcatcagtt tgtccaatag ttttaaaggc cataatcatc 60 
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